This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



THIS PAGE BLANK (uspto) 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Internati nal Patent Classiflcati n 6 : 

C07K 14/705, C12N 15781, 1/19, C12Q 
1702 



Al 



(11) Internati nal Publicati n Number: 
(43) International Publication Date: 



WO 96/13520 

9 May 1996 (09.05.96) 



(21) International Application Number: PCT/US95/ 14364 

(22) International Filing Date: 25 October 1995 (25.10.95) 



(30) Pri rity Data: 

08/332,312 



31 October 1994 (31.10.94) 



US 



(60) Parent Application or Grant 

(63) Related by Continuation 
US 

Filed on 



08/332,312 (CIP) 
31 October 1994 (31.10.94) 



(71) Applicant (for all designated States except US): AMERICAN 
CYAN AMID COMPANY [US/USJ; Five Giralda Farms. 
Madison, NJ 07940 (US). 

(72) Inventors; and 

(75) Inventors/AppUcants (for US only): PAUSCH, Mark, Henry 
[US/US]; 312 Andover Place, Robbinsville, NJ 08692 (US). 
PRICE, Laura, Alicia [US/US]; 181 Canterbury Avenue, 
Langhome, PA 19047 (US). 

(74) Agents: MATHEWS. Gale, F.; American Home Products 
Corporation, Five Giralda Farms, Madison, NJ 07940 (US) 
et al. 



(81) Designated States: AM, AU, BB, BG. BR, BY, CA, CN, CZ, 
EE, FI, GE. HU, IS, JP, KG, KP, KR, KZ, LK. LR, LT, 
LV, MD, MG, MN, MX, NO. NZ, PL, RO, RU, SD, SG, 
SI SK, TJ, TM, TT, UA, US, UZ, VN, European patent 
(AT, BE, CH, DE, DK. ES. FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI. CM, GA, 
GN, ML, MR, NE, SN, TO, TG), ARIPO patent (KE, LS, 
MW, SD, SZ, UG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: GENES ENCODING A FAMILY OF POTASSIUM CHANNELS 
(57) Abstract 

This invention relates generally to the potassium channel gene family. More particularly, the present invention relates to the cloning 
and characterization of potassium channel genes from Drosophila melanogaster and Caenorhabditis elegans. Other aspects of the present 
*veS^ Stances to determine effects on cell growth. Also presented are methods of con^mgi nematode 

£ £ sts by inhibiting potassium channels substantially homologous to those encoded by nucleotide sequences as described herein. 



BNRDOCID- <WO 9613520A1 I > 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States 
applications under the PCT. 



AT 


Austria 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CG 


Congo 


CH 


Switzerland 


CI 


Cote d' Noire 


CM 


Cameroon 


CN 


China 


cs 


Czechoslovakia 


cz 


Czech Republic 


DE 


Germany 


DK 


Denmark 


ES 


Spain 


Fl 


Finland 


FR 


France 


GA 


Gabon 



party to the PCT on the front pages 



GB 


United Kingdom 


GE 


Georgia 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


KyrgysUn 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


U 


Liechtenstein 


LK 


Sri Lanka 


LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MD 


Republic of Moldova 


MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 



pamphlets publishing international 



MR 


Mauritania 


MW 


Malawi 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 



BNSDOCID: <WO 9613520A1 I > 



WO 96/13520 



PCT/OS95/14364 



- 1 



32,421-01 



GENES ENCODING A FAMILY OF POTASSIUM CHANNELS 
Field of Invention 

This invention relates generally to the 
potassium channel gene family. More particularly, the 
present invention relates to the cloning and 
characterization of potassium channel genes from 
5 Drosophlla melanogaster and Caenorhabdltls elegans. 

Background of the Invention 

Synthetic organic insecticides are primarily 

10 nerve poisons acting on the cholinergic system 

(organophosphorus compounds and methylcarbamates) , the 
voltage-gated sodium channel (pyrethroids and DDT) , 
and the GABA-gated chloride channel (cyclodienes and 
other polychlorocycloalkanes) . Potassium channels 

15 comprise a large and diverse group of^integral 

membrane proteins that determine the level of 
excitability and repolarization properties of neurons 
and muscle fibers [B. Hille, Tonic Channels of 
Excitable Membranes, Sinauer, Sunderland, MA (1984)]. 

20 The multiple essential functions encoded by the 

potassium channels make them excellent targets for new 
pesticides and animal and human therapeutics. 
Potassium channel diversity in the fruitfly Drosophlla 
melanogaster results from an extended gene family 

25 coding for homologous proteins. Six genes encoding 
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potassium channels have been cloned from Drosophlla 
melanogaster which account for a large part of the 
diversity of potassium currents observed in insect 
nervous tissue [A. Wei, M. Covarrubias, A. Butler, K. 
5 Baker, M. Pak, L. Salkoff , Science 248, 599-603 

(1990), N.S. Atkinson, G . A. Robertson, B. Ganetzky, 
Science 253,551-555, (1991), J. Wannke , R. Drysdale, 
B. Ganetzky, Science 252, 1560-1564 (1991), A. 
Bruggemann, L. A. Pardo, W. Stuhmer, O. Pongs, Nature 

10 365, 445-448 (1993)]. Shaker and Shal encode voltage- 

gated potassium channels with rapid current activation 
and inactivating properties . Shah and Shaw encode 
delayed rectifier channels, with slow inactivating 
(Shab) and non-inactivating (Shav) properties. Slo 

15 encodes a calcium-activated potassium channel and eagr 

encodes a voltage-gated channel permeable to both 
potassium and calcium which is modulated by cyclic 
AMP* 

Modulation of cardiac action potential by 
20 compounds that effect the behavior of potassium 

channels may be a useful treatment for serious heart 
conditions. In this regard, each of the potassium 
channels cloned from insects have corresponding 
versions in mammalian species, including, 
25 specifically, a delayed rectifier potassium channel 

homolog, RAK, cloned from rat cardiac tissue [M. 
Paulmichl, P. Nasmith, R. Hellmiss, K. Reed, W. A. 
Boyle, J.M. Herbonne , E.G. Peralta, D.E. Clapham, 
Proc. Natl. Acad. Scl USA 88, 7892-7895 (1991)]. 
30 Thus, the RAK channel represents an important target 

of new drugs for the control of heart failure. The 
d layed rectifier potassium current in heart cells 
regulates the duration of the plateau of the cardiac 
action potential by countering the depolarizing, 
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inward calcium current. Delayed rectifier potassium 
currents characteristically are activated upon 
depolarization from rest, display a sigmoidal or 
delayed onset, and have a nonlinear, or rectifying, 
current-voltage relation. Several types of delayed 
potassium conductances have been identified in cardiac 
cells based on measured single-channel conductances. 
Heart rate and contractility are regulated by second 
messenger modification of delayed rectifier potassium 
conductances, and species differences in the shape of 
the plateau may be influenced by the type and level of 
channel expression. 

On the basis of predicted membrane spanning 
topology, potassium channels may be subdivided into 
two distinct classes: voltage-gated, calcium- 
activated, and cyclic nucleotide-gated potassium 
channels that are composed of six membrane spanning 
domains (S1-S6) and a single pore forming domain (H5> , 
and inward rectifying potassium channels that pass 
through the membrane twice and also contain a single 
pore forming region [Y. Kubo, E. Reuveny, P. A. 
Slesinger, Y.N . Jan, L.Y. Jan Nature 364, 802-806 
(1993) ; Y. Kubo, T.J. Baldwin, Y.N. Jan, L.Y. Jan 
Nature 362, 127-133 (1993)]. Here, we report the 
cloning and functional expression in yeast of a novel 
Drosophila melanomas ter potassium channel. Further, 
we identify a Caenorha hrf i txs elegans homolog that 
constitutes the second member of a new family of 
potassium channels exhibiting a topological 
configuration unique among the known classes of 
potassium channels. 

The yeast Saccharomyces cerevxsxae is 
utilized as a model eukaryotic organism for the 
purpose of studying potassium transport mechanisms. 
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Due to the ease with which one can manipulate the 
genetic constitution of the yeast Saccharomyces 
cerevisxae, researchers have developed a detailed 
understanding of many complex biological pathways, 
5 including potassium transport. In yeast, high 

affinity potassium uptake is performed by the product 
of the TPK1 gene [R.F. Gaber, C.A. Styles, G.R. Fink 
Mai. Cell. Biol. 8, 2848-2859 (1988)]. Mutant yeast 
strains lacking trkl function are incapable of growing 

10 in medium lacking high concentrations of potassium. 

Since potassium transport mechanisms are present in 
organisms as divergent as yeast and man, one could 
predict that expression of heterologous potassium 
channels in mutant cells might replace trkl function, 

15 and support growth on medium containing low potassium 

concentration. In this regard, plant potassium 
channels were shown to function in yeast and represent 
important targets for new herbicides [J. A Anderson, 
S.S. Huprikar, L.V. Kochian, W.J. Lucas, R.F. Gaber, 

20 Proc. Natl. Acad. Scl USA 89, 3736-3740 (1992); H. 

Sentenac, N . Bonnaud, M. Minet, F. Lacroute, J.-M. 
Salmon, F. Gaynard, C. Grignon, Science 256, 663-665 
(1992) ; D.P. Schachtman and J.I. Schroeder, Nature 
370, 655-658]. Thus, we have employed this yeast 

25 expression system for cloning and expression of 

potassium channels from heterologous species, making 
it useful for discovery of new pesticides, and animal 
and human therapeutics. Discovery of such compounds 
will necessarily require screening assays of high 

30 specificity and throughput. For example, new 

pesticides directed at potassium channels require high 
selectivity for insect channels and low activity 
against non-insect species. Screening assays 
utilizing yeast strains genetically modified to 
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accommodate functional expression of heterologous 
potassium channels offer significant advantages in 
this area. 

Summary of the Invention 

A first aspect of the present invention is 
the discovery of a new subclass of potassium channel 
genes and proteins encoded thereby. Potassium 
channels belonging to this new subclass comprise four 
hydrophobic domains capable of forming transmembrane 
helices, wherein a first pore-forming domain is 
interposed between the first and second transmembrane 
helices and a second pore-forming domain is interposed 
between the third and fourth transmembrane helices, 
and wherein each pore-forming domain contains a 
potassium selective peptide motif. In preferred 
embodiments, the peptide motif is selected from the 
group consisting of a Y/F-G dipeptide motif. 

In certain preferred embodiments , the 
isolation and characterization of invertebrate (i.e. 
insect and nematode) potassium channel genes belonging 
to this new subclass is presented. In more preferred 
embodiments, the present invention provides for the 
isolation of complementary DMA fragments from 
Drosophlla melanogast&r and Caenorhabdltls ele&ans 
which encode conserved amino acid sequence elements 
unique to this potassium channel gene family. A yeast 
expression technology is employed to clone cDNAs from 
Drosophlla melanogaster and C. eiegans and a 
hybridization approach is utilized to isolate 
additional cDNAs from Caenorhabdltls elegans. 

A second aspect of the present invention is 
a method of assaying substances to determine effects 
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on cell growth. Yeast cells of the kind described 
above are cultured in appropriate growth medium to 
cause expression of heterologous proteins, embedded in 
agar growth medium, and exposed to chemical compounds 
5 applied to the surface of the agar plates . Effects on 

the growth of embedded cells are found around 
compounds that have effects on the heterologous 
potassium channel. 

A third aspect of the present invention is a 
10 method of controlling nematode and insect pests by 

inhibiting potassium channels substantially homologous 
to those encoded by nucleotide sequences as presented 
herein . 

15 Brief Description of the Drawings 

FIGURE 1. Growth of CY162 cells bearing pDmORFl . 
CY162 cells transformed with plasmids isolated from 
survivors of a primary library screen for plasmids 

20 that support the growth of CY162 on medium contain low 

potassium concentration. Six individual transf ormants 
of each plasmid-bearing strain are cultured in patches 
on the indicated medium. CY162 cells bearing pDmORFl 
are found in the upper left-hand corner of each plate 

25 while pKATl containing cells are found in the lower 

right hand corner. 

FIGURE 2A and 2B. DNA sequence and deduced amino acid 
sequence of Dm ORF1 [SEQ ID NOS : 1 and 2] . The 
30 nucleotide sequence of the 2.4 kb cDNA revealed a 

single long open reading frame pr ximal to the GAL1 
promoter. Segments corresponding to putative 
transmembrane (M1-M4) and pore-f rming H5 domains in 
the predicted polypeptide are underlined. The single 
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amino- terminal asparagine linked glycosylation site is 
indicated by a G. 

FIGURE 3A and 3B. DNA sequence and deduced amino acid 
sequence of the F22b7 . 7 segment of the Caenorhabditls 
elegans genome [SEQ ID NO: 3]. Segments corresponding 
to putative transmembrane (M1-M4) and pore-forming H5 
domains in the predicted polypeptide are underlined. 

FIGURE 4 . Alignment of DmORFl and F22b7 . 7 sequences . 
Protein-coding regions of DmORFl [SEQ ID NO: 37] and 
F22b7.7 [SEQ ID NO: 38] (designated as CeORF-1 in this 
FIGURE) are compared using the protein sequence 
alignment algorithm in Genework DNA sequence analysis 
software. Identical amino acids are boxed. 

FIGURE 5A. Comparison of the pore-forming domains of 
DmORFl and F22b7.7. Amino acid sequences from the 
six cloned Drosophlla melanograster potassium channels 
and three inward rectifier channels [SEQ ID NOS:7 
through 21] are compared to DmORFl and F22b7.7 within 
the pore-forming H5 regions. Amino acid identities 
are indicated by a vertical line and conserved 
substitutions indicated by a dot. Amino acid 
substitutions deemed acceptable are indicated. 

FIGURE 5B. Hydropathy plot analysis of the DmORFl and 
F22b7 . 7 polypeptide sequence . The Kyte-Doolittle 
hydropathy algorithm in the Geneworks DNA analysis 
software is used to predict the topology of DmORFl and 
F22b7.7. The position of predicted membrane spanning 
domains (M1-M4) and pore-forming domains are 
indicated. 
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FIGURE 6 . Predicted membrane spanning topology of 
DmORFl . 

FIGURE 7 . Heterologous potassium channel-dependent 
growth of plasmid bearing CY162 (trklA) strains. 
CY162 bearing pYES2, pKATl , pDmORFl, and pRATRAK are 
cultured at 30°C for four days on arginine phosphate 
agar medium containing 0 mM, 0.2 xnM, or 100 mM added 
KC1. 



FIGURE 8. Inhibition of growth of yeast cells 
containing heterologous potassium channels. CY162 
cells (10 5 ) bearing the indicated plasmids are plated 
in arginine phosphate agar medium containing 0.2 raM 
15 potassium chloride. Sterile filter disks were placed 

on the surface of the agar and saturated with 20 ml of 
a 1 M solution of potassium channel blocking compound. 
Clockwise from upper left-hand corner is BaCl 2/ CsCl, 
TEA, and RbCl . KC1 is applied to the center disk. 

20 

FIGURE 9A and 9B. DNA sequence and deduced amino acid 
sequence of CORK [SEQ ID NO: 36] . The nucleotide 
sequence of the 1.4 kb cDNA revealed a single long 
open reading frame proximal to the GAL1 promoter. 
25 Segments corresponding to pore-forming H5 domains in 

the predicted polypeptide are underlined. Asparagine- 
linked glycosylation sites are indicated by a G. 



Detailed Description of the Invention 

Nucleotide bases are abbreviated herein as follows : 
Ade ; A- Adenine G-Guanine Ura; U-Uracil 
C-Cytosine T-Thymine 

Amino acid residues are abbreviated herein to either three 
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letters or a single letter as follows: 
Ala ; A-Alanine Leu ; L-Leucine 
Arg ;R-Arginine Lys ;K-Lysine 
Asn ;N-Asparagine Met ;M-Methionine 
5 Asp;D-Aspartic acid Phe ;F-Phenylalanine 
Cys ;C-Cysteine Pro ;P-Proline 
Gln;Q-Glutamine Ser;S-Serine 
Glu;E-Glutamic acid Thr ;T-Threonine 
Gly;G-Glycine Trp;W-Tryptophan 
10 His;H-Histidine Tyr ;Y-Tyrosine 

He ; I-Isoleucine Val ;V- Valine 

The term "mammalian" as used herein refers to any mammalian 
species (e.g. , human, mouse, rat, and monkey) . 

15 The term "heterologous" as used herein refers to DNA 

sequences, proteins, and other materials originating from 
organisms other than the organism used in the expression of 
the potassium channels or portions thereof, or described 
herein (e.g., mammalian, avian, amphibian, insect, plant), 

20 or combinations thereof not naturally found in yeast. 

The terms "upstream" and "downstream" are used herein 
to refer to the direction of transcription and translation, 
with a sequence being transcribed or translated prior to 
another sequence being referred to as "upstream" of the 

25 latter. 

The potassium channels of the present invention possess 
properties in common with known potassium channels 
including, voltage-gated channels, calcium activated 
channels, cyclic nucleotide gated channels, inward rectifier 
30 channels, and the like, and especially with regard to 

electrophysiological properties. Certain preferred channels 
exhibit inward and outward currents that are affected by 
potassium concentration, particularly characteristic of 
voltage-gated channels. The term "channel" and the 
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nucleotide sequences encoding same , is intended to encompass 
subtypes of the aforementioned classes of channels , and 
mutants, derivatives and homologs thereof. 

The nucleotide sequences encoding the potassium 
5 channels or parts thereof may be expressed recombinantly , 

and utilized for a variety of reasons, the most notable of 
which is for screening of substances that modulate the 
activity of the potassium ion channels. Such substances, 
especially inhibitors of the activity of the potassium 

10 channels of the present invention, may be utilized as 
insecticides, antihelmenthics , drugs suitable for the 
control of heart failure, and the like. 

Heterologous DNA sequences are typically expressed in a 
host by means of an expression vector. An expression vector 

15 is a repli cable DNA construct in which a DNA sequence 

encoding the heterologous DNA sequence is ope r ably linked to 
suitable control sequences capable of affecting the 
expression of a protein or protein subunit coded for by the 
heterologous DNA sequence in the intended host. Generally, 

20 control sequences include a transcriptional promoter, an 

optional operator sequence to control transcription, a 
sequence encoding suitable mRNA ribosomal binding sites, and 
(optionally) sequences which control the termination of 
transcription and translation. Vectors useful for 

25 practicing the present invention include plasmids, viruses 

(including bacteriophage) , and integra table DNA fragments 
(i.e., fragments integra table into the host genome by 
genetic recombination) . The vector may replicate and 
function independently of the host genome, as in the case of 

30 a plasmid, or may integrate into the genome itself, as in 

the case of an integratable DNA fragment. Suitable vectors 
will contain replicon and control sequences which are 
derived from species compatible with the intended expression 
host* For example, a promoter operable in a host cell is 
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one which binds the RNA polymerase of that cell, and a 
ribosomal binding site operable in a host cell is one which 
binds the endogenous ribosomes of that cell. 

DNA regions are operably associated when they are 
5 functionally related to each other. For example, a promoter 
is operably linked to a coding sequence if it controls the 
transcription of the sequence; a ribosome binding site is 
operably linked to a coding sequence if it is positioned so 
as to permit translation. Generally, operably linked means 
10 contiguous and, in the case of leader sequences, contiguous 
and in reading phase. 

Transformed host cells of the present invention are 
cells which have been transformed or transfected with the 
vectors constructed using recombinant DNA techniques and 
15 express the protein or protein subunit coded for by the 

heterologous DNA sequences. In preferred embodiments, the 
transformed host cells are yeast. A variety of yeast 
cultures, and suitable expression vectors for transforming 
yeast cells, are known. See e.g., U.S. Patent No. 
20 4,745,057; U.S. Patent No. 4,797,359; U.S. Patent No. 

4,615,974; U.S. Patent No. 4,880,734; U.S. Patent No. 
4,711,844; and U.S. Patent No. 4,865,989. Saccharomyces 
cererisiae is the most commonly used among the yeasts, 
although a number of other yeast species are commonly 
25 available. See. e.g., U.S. Patent No. 4,806,472 

(Kluveromyces lactls and expression vectors therefore) ; 
4,855,231 (Plchla pastoris and expression vectors 
therefore) . A heterologous potassium channel may permit a 
yeast strain unable to grow in medium containing low 
30 potassium concentration to survive [CY162, for example, see 
J. A Anderson, S.S. Huprikar, L.V. Kochian, W.J. Lucas, R.F. 
Gaber, Proc. Natl. Acad. Sci USA 89, 3736-3740 (1992)]. 
Yeast vectors may contain an origin of replication from the 
endogenous 2 micron <2u) yeast plasmid or an autonomously 
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replicating sequence (ARS) which confer on the plasmid the 
ability to replicate at high copy number in the yeast cell, 
centronteric (CEN) sequences which limit the ability of the 
plasmid to replicate at only low copy number in the yeast 
5 cell, a promoter, DNA encoding the heterologous DNA 

sequences, sequences for poly-adenylation and transcription 
termination, and a selectable marker gene. An exemplary 
plasmid is YRp7, (Stinchcomb et al. , (1979) Nature 282, 39; 
Kingsman et al., (1979) Gene 7, 141; Tschemper et al . , 

10 (1980) Gene 10, 157] . This plasmid contains the TRP1 gene, 

which provides a selectable marker for a mutant strain of 
yeast lacking the ability to grow in the absence tryptophan, 
for example ATCC No. 44076. The presence of the trpl lesion 
in the yeast host cell genome then provides an effective 

15 environment for detecting transformation by growth in the 

absence of tryptophan. 

Suitable promoting sequences in yeast vectors include 
the promoters for metallothionein (Y£p52) , 3- 
phosphoglycerate kinase [pPGKH, Hitzeman et al . , (1980) J. 

20 Biol. Chem. 255, 2073] or other glycolytic enzymes [pYSK153, 

Hess et al., (1968) J. Adv. Enzyme Reg. 7, 149]; and Holland 
et al. , (1978) Biochemistry 17, 4900], such as enolase, 
glycer aldehyde- 3 -phosphate dehydrogenase , hexokinase , 
pyruvate decarboxylase , phospho-f ructokinase , glucose-6- 

25 phosphate isomerase, 3 -phosphogly cerate mutase, pyruvate 

kinase, trioseposphate isomerase, phosphoglucose isomerase, 
and glucokinase. Suitable vectors and promoters for use in 
yeast expression are further described in R. Hitzeman et 
al., EPO Publn. No. 73,657. Other promoters, which have the 

30 additional advantage of transcription controlled by growth 

conditions, are the promoter regions for alcohol 
dehydr genase 2 (pAD4M) , isocytochrome C, acid phosphates, 
degradative enzymes associated with nitrogen metabolism, and 
the aforementioned metallothionein and glyceraldehyde-3- 
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15 



phosphate dehydrogenase, as well as enzymes responsible for 
maltose and galactose (pYES2) utilization. Finally, in 
constructing suitable expression plasmids, the termination 
sequences associated with these genes may also be ligated 
into the expression vector 3' of the heterologous coding 
sequences to provide polyadenylation and termination of the 
mRNA. 

In one embodiment of the present invention, a yeast 
expression system is described, wherein yeast cells bear 
heterologous potassium channels. In preferred embodiments, 
these channels are DmORF-1, CORK, or RAK. As noted above, 
transformed host cells of the present invention express the 
proteins or proteins subunit coded for by the heterologous 
DNA sequences. When expressed, the potassium channel is 
located in the host cell membrane (i.e., physically 
positioned therein in proper orientation for both the 
stereoselective binding of ligands and passage of potassium 
ions) . 

In certain preferred screening embodiments of the 
present invention, a transformed yeast cell is presented, 
containing a heterologous DNA sequence which codes for a rat 
cardiac delayed rectifier potassium channel, RAK, cloned 
into a suitable expression vector. RAK is capable of 
complementing the potassium-dependent phenotype of 
25 saccharomyces cerevisiae strain CY162 on medium containing 
low potassium concentration. 

The potassium channel subclass of the present invention 
is characterized in that the potassium channels have four 
hydrophobic domains capable of forming transmembrane 
helices. These channels are further characterized in that 
they comprise two pore-forming d mains, one of which is 
interposed between said first helix and said second helix, 
and the other of which is interposed between said third 
helix and said fourth helix. The pore-forming domains 
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further contain a potassium selective motif which serves to 
confer upon the channel the ability to pass potassium ions 
to the exclusion of other ions, such as sodium, calcium, and 
the like. In certain preferred embodiments, this motif 
5 contains the peptide Y/G, and particularly in either a 

dipeptide or tripeptide motif, and frequently with Y/F-G 
bonding. In most preferred embodiments, the motif is 
selected from the group consisting of G-V-G, G-L-G, G-Y-G, 
G-F-G, and G-I-G. 

10 In certain embodiments of the present invention, the 

potassium channel is positioned within a cell membrane in 
such a manner as to allow it to function as a modulator of 
the flow of potassium ions into and out of the cell. To 
best regulate this activity, at least one pore-forming 

15 domain may be positioned proximal to a exterior portion of 

the cell membrane . 

In other embodiments, the potassium channels of the 
present invention further comprise an amino- terminal 
glycosylation site, and especially wherein that site is 

20 asparagine-linked. 

Potassium channels belonging to the subclass as 
presented herein may be derived from a wide variety of 
animal species, both vertebrate and invertebrate. Using the 
yeast expression technology and other teachings as set forth 

25 herein, the present inventors have isolated a single 2463 

base pair cDNA fragment from an invertebrate source, 
designated Dm ORF1 [SEQ ID NO: 1] , by complementation of the 
potassium-dependent phenotype of Saccharamyces cerevisiae 
strain CY162 itrklA) on medium containing low potassium 

30 concentration [J. A Anderson, S.S. Huprikar, L.V. Kochian, 

W.J. Lucas, R.F. Gaber, Proc. Natl. Acad. Sc± USA 89, 3736- 
3740 (1992) ] . Dm ORF1 contains a single long open reading 
frame encoding a protein of 618 amino acids [SEQ ID NO; 2] 
that exhibits substantial amino acid identity to the pore- 
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forming regions of other potassium channels. The DmORFl 
contains structural features that distinguish it from other 
classes of potassium channels, including four hydrophobic 
domains capable of forming transmembrane helices (M1-M4) and 
two putative pore forming H5 domains found between 
transmembrane helices Ml and M2, and M3 and M4 . Each pore 
forming H5 domain contains the Y/F-G dipeptide motif 
required for potassium selectivity [L. Heginbotham, T. 
Abramson, R . MacKinnon, Science 258, 1152-1155, (1992)]. 
This work was expanded to clone a construct derived from C. 
eiegans having a single open reading frame sufficient to 
encode a protein of 434 amino acids, designated pCORK. 

A search of the GENBANK database for DNA and protein 
sequences similar to DmORFl revealed several cloned 
15 potassium channel sequences including a putative protein 

coding DNA sequence, F22b7 . 7 , reported in the Caenorhabdltls 
elegans genome sequencing project [R. Wilson, R. Ainscough, 
K Anderson, et al. Nature 368, 32-38 (1994)]. The DNA 
sequence contained a single long open reading frame 
20 sufficient to encode a protein of 336 amino acids (predicted 

MW 38.5 kDa) with substantial homology to known potassium 
channel sequences . 

Using the hybridization approach, a cDNA sequence 
designated CeORFl [SEQ ID NO: 38] was isolated by probing a 
25 Caenorhabdltls elegans cDNA library with oligonucleotides 

designed using F22b7.7 DNA sequences [T.N. Davis and J. 
Thorner Meth. Enzymol. 139, 246-262 (1987)]. CeORFl 
contains a single long open reading frame encoding a protein 
that exhibits substantial amino acid identity to pore- 
30 forming regions of other potassium channels. j 

CeORFl and pCORK each contain structural features 
similar to DmORFl, including two putative pore forming H5 
domains . Each pore forming H5 domain contains the Y/F-G 
dipeptide motif required for potassium selectivity [L. 
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Heginbotham, T. Abramson, R. MacKinnon, Science 258, 1152- 
1155, (1992)]. These features form the basis of the 
designation of a new sub-family of potassium channels 
coii3>rising DmORFl , CORK, and CeORFl . 
5 Other aspects of the present invention relate to 

methods of modulating potassium channel activity, by 
affecting the ability of such channel to allow the flow of 
ions into, through, or out of a cellular membrane, and 
particularly when these ions are potassium ions. Certain 
10 substances whether biological or chemical in nature, may be 

applied to cell membranes having as an integral part of 
their structure, one or more potassium channels comprising 
the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID NO: 36, or RAK, in an amount and for a time sufficient to 
15 affect the ability of the potassium channel to so regulate 

the flow of ions. Substances that are potassium channel 
blockers will inhibit the ability of the channel to regulate 
the flow of such ions. Substances that enhance such ability 
may be considered potassium channel "activators." 
20 Substances that modulate the activity of RAK may do so by 

modulation of cardiac action potential, upward or downward. 

Application of such substances may take the form of in 
vitro, ex vivo, or in vivo application, each in a 
formulation suitable to deliver the substance to the cell 
25 membrane and to sustain such delivery for a time sufficient 
to allow the substance to interact with the membrane. 
Appropriate formulations, concentrations of substances, 
application time, and other relevant parameters may be 
established by utilizing, Inter alia, known assays for 
30 measuring ion channel current flow. Another suitable 

endpoint one skilled in the art may utilize in optimizing 
these parameters, especially in the case of potassium 
channel blockers, is "cell death". Such assays may be 
performed in vitro and extrapolated to in vivo conditions, 
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or in some cases may be easily established directly in vivo, 
as for example, by applying the substance directly to a test 
sample comprising the target insect pest (whole organism) 
and noting the appropriate parameters at which an acceptable 
per cent of insect death is attained. 

In certain preferred embodiments , methods of 
selectively inhibiting insect pests are presented by 
applying to such insect pests a substance capable of 
selectively inhibiting the activity of a potassium channel 
contained in the cells of such insect, and comprising the 
amino acid sequence of SEQ ID NO: 2, or a potassium channel 
substantially homologous thereto. In the most preferred 
embodiments, the inhibitor will inhibit the activity of the 
aforementioned potassium channel without inhibition of 
15 other, non -homologous potassium channels that may be present 
in species other than the targeted insect pest. It is 
envisioned that such other species may also be present at 
the site of application of the inhibitor, such as in a 
garden, crop, or other site wherein it is desired to control 
20 insect pests. In other preferred embodiments, methods of 

selectively inhibiting nematode pests are presented much in 
the same manner as discussed for control of insect pests, by 
applying to such pests a substance capable of selectively 
inhibiting the activity of a potassium channel contained in 
25 the cells of such pest, and comprising the the amino acid 
sequence of SEQ ID NO: 4 or SEQ ID NO: 36, or potassium 
channels substantially homologous thereto. 

The following Examples are provided to further 
illustrate various aspects of the present invention. They 
are not to be construed as limiting the invention. 



30 



Example 1 

Recombinant expression library screening. 
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Saccharomyc&s cerBvlslae strain CY162 is described in 
Anderson, J. A. et al. (1992) Proc. Natl. Adad. Sex. USA 89, 
3736-3740] . Growth of bacterial strains and plasznid 
manipulations are performed by standard methods (Maniatis 
5 t . , Molecular Cloning . Cold Spring Harbor Laboratory Press , 
1982) . Media conditions for growth of yeast, isolation of 
plasxnid DNA from yeast, and DNA-mediated transformation of 
yeast strains are as described (Rose M. D. , Methods in yeast 
genetics, Cold Spring Harbor Laboratory Press, 1990). A 

10 multifunctional expression library constructed in pYES2 and 

containing cDNA made from 3rd instar male Drosophlla 
melanogaster mRNA is used as described [S.J. El ledge, J.T. 
Mulligan, S.W. Ramer, M. Spottswood, R.W. Davis Proc. Natl. 
Acad. Sc± USA 88, 1731-1735 (1991)]. A multifunctional 

15 expression library constructed in pYES2 and containing cDNA 

made from mRNA obtained from all life stages of 
Caenorhabdltls elegans is custom-made by Invitrogen 
Corporation . 

Isolation of expression plasmids encoding heterologous 
20 potassium channels . CY162 cells are transformed with 

plasmid DNA from each library to give 3 x 10 € transf ormants 
from each library on SCD-ura (synthetic complete dextrose (2 
%) medium containing all necessary nutritional supplements 
except uracil) containing 0.1 M KC1 agar medium. 
25 Transformants are replica-plated to SCG-ura (synthetic 

complete galactose (2 %) medium containing all necessary 
nutritional supplements except uracil) agar medium. 
Colonies that grow on this selective agar medium are 
transferred to SCG-ura agar medium to obtain single colonies 
30 clones and while reassaying suppression of the potassium- 

dependent phenotype. Plasmid DNA is isolated from surviving 
colonies and used to transform CY162. Six individual 
transf ormant strains containing one plasmid, pDmORFl , that 
confers the potassium independent phenotype is cultured on 
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SCD-ura and SCG-ura medium along with CY162 strains bearing 
pKATl , which encodes a plant inward rectifier potassium 
channel that supports the growth of CY162 on selective 
medium (FIGURE 1) . The plasmid bearing strains exhibit 
potassium-independent growth on both dextrose and galactose 
containing medium. Growth on dextrose is likely due to 
basal level of transcription leading to sufficient potassium 
channel expression to support growth. 

Example 2 

DNA sequence analysis of DmORFl . Plasmids that confer 
suppression of the potassium-dependent phenotype are 
subjected to automated DNA sequence analysis performed by 
high temperature cycle sequencing (Applied Biosystems) . 
Geneworks DNA sequence analysis software (Intelligenetics) 
is used to align raw DNA sequence information and to 
identify open reading frames. The DNA sequence of the 2.4 
kb insert in pDmORFl is displayed in FIGURE 2A and 2B [SEQ 
ID NO:l]. The 5' untranslated sequences of the cDNA contain 
long poly A and poly T tracts not likely to be found in 
protein coding regions. The first ATG proximal to the 5' 
end is present in a consensus Drosophila melaziogaster 
translational initiation site [D.R. Cavener Nucleic Acids 
Res., 15, 1353-1361 (1987)], consistent with the designation 
of this site as the translational start site. A single long 
open reading frame sufficient to encode a protein of 618 
amino acids (predicted MW 68 kDa) is encoded in pDmORFl . A 
consensus polyadenylation site, AATCAA, occurs at position 
2093-2098 in 3' untranslated sequences. The DmORFl contains 
structural features that distinguish it fr m other classes 
of potassium channels, including four hydroph bic domains 
capable of forming transmembrane helices (M1-M4) and two 
pore forming H5 domains found between transmembrane helices 
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Ml and M2, and M3 and M4 . Each pore forming H5 domain 
contains the Y/F-G dipeptide motif required for potassium 
selectivity [L. Heginbotham, T. Abramson, R« MacKinnon, 
Science 258 , 1152-1155, (1992)]. 

5 

Example 3 

Identification of Caenorhabdltxs elegans sequences 

10 homologous to DmORFl . A search of the GENBANK database 

protein sequences similar to DmORFl reveals significant 
matches with several known potassium channel sequences . The 
closest match is to a putative protein coding DNA sequence, 
F22b7.7, reported in the Caenorhahdi tis elegans genome 

15 sequencing project [R. Wilson, R. Ainscough, K. Anderson, et 

ai. r Nature 368, 32-38 (1994)]. The DNA sequence and 
predicted amino acid sequence assembled from putative exons 
recognized by a GENBANK exon identification algorithm is 
displayed in FIGURE 3A and 3B [SEQ ID NOS : 3 and 4]. The DNA 

20 sequence contains a single long open reading frame 

sufficient to encode a protein of 336 amino acids (predicted 
MW 38.5 kDa) with substantial homology to known potassium 
channel sequences. The F22b7.7 sequence contains structural 
features that distinguish it from other classes of potassium 

25 channels, including three of four hydrophobic domains 

capable of forming transmembrane helices (M1-M4) identified 
in DmORFl and two pore forming H5 domains found between 
transmembrane helices a predicted Ml and M2, and M3 and M4 . 
Each pore forming H5 domain contains the Y/F-G dipeptide 

30 motif required for potassium selectivity [L. Heginbotham, T. 
Abramson, R. MacKinnon, Science 258, 1152-1155, (1992)]. 
The lack of an amino terminal transmembrane domain 
homologous to DmORFl Ml in the F22b7 . 7 sequence may be due 
to failure of the search algorithm to identify exon(s) 
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encoding the amino terminus. Alternatively, an amino 
terminal coding sequence may be added by trans -splicing, 
which occurs frequently in Caenorhabdltxs elegans. 

Example 4 

Cloning and DNA sequence analys is of CeORFl . 

Oligonucleotides corresponding to DNA sequences encoding the 
two pore forming domains of F22b7 . 7 are synthesized using an 
Applied Biosystems DNA synthesizer. 
F22b7 . 7-H2-1 : 

5 ' TCCATTTTCTTTGCCGTAACCGTCGTCACTACCATCGGATACGGTAATCCA [SEQ 
IDNO:5). F22b7.7-H2-2: 

5 • TCATTCTACTGGTCCTTCATTACAATGACTACTGTCGGGTTTGGCGACTTG [ SEQ 
ID NO: 6] The oligos were labelled at their 5' ends with 
32 P using a 5 ' -end labelling kit according to manufacturers 
instructions (New England Nuclear) . The labelled oligos are 
pooled and used to screen 6 x 10 5 plaques from a A.ZAP- 
Caenorhabditis elegans cDNA library (obtained from 
Clontech) by published methods [T.N. Davis and J. Thorner 
Meth. Enzymol. 139, 246-262 (1987)]. Hybridization is at 
42°C for 16 hours. Positive clones are plaque-purified by 
twice repeating the hybridization screening process. 
Plasmid DNAs , excised from phage DNA according to 
manufacturers instructions, are subjected to automated DNA 
sequence analysis performed by high temperature cycle 
sequencing (Applied Biosystems). Geneworks DNA sequence 
analysis software (Intelligenetics) is used to align raw DNA 
sequence data and to identify open reading frames. 

Example 5 

Comparison of the putative proteins encoded by DmORFl and 
F22b7.7. Predicted amino acid sequences of DmORFl and 
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F22b7.7 are aligned and displayed in FIGURE 4 [SEQ ID NOS:37 
and 38] . Only limited overall amino acid homology is 
exhibited by these two proteins with regions of greatest 
homology existing in the pore forming H2-1 and H2-2 domains. 
5 FIGURE 5A shows a comparison of the pore forming domains of 

DmORFl and F22b7.7 with those of the known Drosophila 
melanogaster potassium channel and inward rectifier 
sequences [SEQ ID NOS : 7 through 21]. Amino acid identities 
greater than 50 % are observed with all potassium channel 

10 sequences. FIGURE 5B shows hydropathy plot analysis of 

DmORFl and F22b7 . 7 . The two proteins , which show remarkable 
topological similiarity through their length, are predicted 
to be composed of four membrane- spanning hydrophobic domains 
(M1-M4) , and two pore forming H2 domains. These data 

15 suggest the predicted topology shown in FIGURE 6. Both 

proteins are predicted to span the membrane four times with 
amino and carboxyl termini residing within the cell. This 
topology places the single amino- terminal asparagine-linked 
glycosylation site and H2 domains on the cell exterior 

20 permitting permeation of the membrane by the pore forming 

domains from the outside, an absolute requirement for the 
formation of a functional potassium channel. 

Example 6 

25 Functional expression of a rat atrial delayed rectifier 

potassium channel in yeast . CY162 transf ormants containing 
plasmids pKATl , which encodes a plant inward rectifier 
potassium channel, pRATRAK , which encodes a rat atrial 
delayed rectifier potassium channel, pDmORFl, and control 

30 plasmid pYES are cultured on arginine-phosphate-dextrose 

agar medium lacking ura medium [A. Rodriguez-Navarro and J. 
Ramos, J. Bacterial. 159, 940-945, (1984)] containing 
various KC1 c ncentrati ns (FIGURE 7) . Strains containing 
pKATl, pRATRAK, and pDmORFl all supp rt the growth of CY162 
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on medium containing a low concentration of potassium, while 
pYES2 containing CY162 cells only grow on medium containing 
a high potassium concentration, indicating that heterologous 
potassium channels of several different types function to 
provide high affinity potassium uptake. 

pRATRAK is constructed by modifying the protein-coding 
sequences of RATRAK to add 5' Hindlll and 3' Xbal sites 
using PCR. In addition, four A residues are added to the 
sequences immediately 5' proximal to the initiator ATG to 
provide a good yeast translational initiation site. The 
modified fragment is cloned into the Hindlll and Xbal sites 
in the yeast expression vector pYES2 (Invitrogen) , forming 
pRATRAK . 

Example 7 

Bioassay of functional expression of hetero logous potassium 
channels 

Yeast strains dependent on heterologous potassium 
channels for growth should be sensitive to non-specific 
potassium channel blocking compounds. To test the potassium 
channel blocking properties of several compounds, a 
convenient agar plate bioassay is employed. Strains 
containing pKATl, pRATRAK, pDmORFl , and pYES2 are plated in 
arginine-phosphate-dextrose agar medium lacking ura and 
containing various amounts of potassium chloride. Arginine- 
phosphate-dextrose medium is used to avoid interference from 
potassium and ammonium ions present in standard synthetic 
yeast culture medium. Sterile filter disks were placed on 
the surface of the agar and saturated with potassium channel 
blocking ions CsCl, BaCl 2 , and TEA. The growth of 
heterologous potassium channel containing strains is 
inhibited by potassium channel bl eking ions, in a channel 
dependent manner. DmORFl -dependent growth is blocked by 
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BaCl 2 but not by CsCl or TEA. KAT-dependent growth is 
blocked by BaCl 2/ CsCl and TEA. RATRAK-dependent growth is 
blocked by BaCl 2/ CsCl and TEA to a much greater extent than 
pKATl, reflecting in part a slower growth rate of pRATRAK- 
5 containing cells. These observations confirm that these 

channels support the growth of the mutant yeast cells and 
demonstrate the efficacy of the yeast bioassay for screening 
for compounds that block potassium channel function. The 
control pYES-containing strain grows only around applied KC1 
10 and RbCl, a congener of KC1. 

Example 8 

Identification of compounds that alter potassium channel 

15 activity 

Yeast strains made capable of growing on medium 
containing low potassium concentration by expression of 
heterologous potassium channels are used to screen 
libraries of chemical compounds of diverse structure for 

20 those that interfere with channel function. CY162 cells 

containing pKATl , pRATRAK, pDmORFl , pCeORFl , and pYES2-TRKl 
(lOYml) are plated in 200 ml of arginine-phosphate-dextrose 
agar medium lacking ura and containing 0 . 2 mM potassium 
chloride in 500 cm 2 plates. The CY162 cells bearing pYES2- 

25 TKK1 are included in the assay as a control to identify 

compounds that have non-specific effects on the yeast strain 
and are therefore not specifically active against the 
heterologous potassium channels. Samples of chemical 
compounds of diverse structure (2 pi of 10 nig /ml solution in 

30 DMSO) are applied to the surface of the hardened agar medium 

in a 24 x 24 array. The plates are incubated f r 2 days at 
30°C during which time the applied compounds radially 
diffuse into the agar medium. The effects of applied 
compounds on strains bearing heterologous potassium channel 
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genes are compared to the pYES2-TRK2 bearing strain. 
Compounds that cause a zone of growth inhibition around the 
point of application that is larger on plates containing 
cells bearing the heterologous potassium channels than that 
observed around the P YES2-TRK1 bearing strains are 
considered selective potassium channel blockers. Compounds 
that induce a zone of enhanced growth around the point of 
application that is larger on plates containing cells 
bearing the heterologous potassium channels than that 
observed around the pYES2 -TKK1 bearing strains are 
considered selective potassium channel openers. 

Example 9 

DmORFl- induced currents in X. laevis oocytes assayed by two- 
electrode voltage clamp 

DNA sequence analysis of the pDmORF insert strongly 
suggest that the protein encoded by the single long ORF 
possesses properties in common with known potassium 
channels. To test this hypothesis, the electrophysiological 
properties of the putative potassium channel encoded by 
DmORF was examined by expression in X. laevis oocytes. 
Currents were measured by two-electrode whole-cell voltage 
clamp. DNA sequences encoding the open reading frame of 
DmORFl were amplified by polymerase chain reaction (PCR) 
using the following oligonucleotides: 

MP023: ATAAAGCTTAAAAATGTCGCCGAATCGATGGAT [SEQ ID NO: 22] 
MP024: AGCTCTAGACCTCCATCTGGAAGCCCATGT [SEQ ID NO: 23] 
The full length PCR product was cloned into corresponding 
sites in pSP64 poly A ( Pr omega ) , forming pMPl47. Template 
DNA was linearized with EcoRI and RNA transcribed using the 
Message Machine (Ambion) in vitro transcription kit 
according to manufacturers instructions. A sample of the 
RNA was resolved in a MOPS -acetate-formaldehyde agarose gel 
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and RNA content was estimated by ethidium bromide staining. 
The remainder was stored on dry ice. X. laevis oocytes were 
isolated and injected with 50 nl of sterile TE containing 5- 
20 ng transcript according to published procedures. After 
5 three days, whole oocyte currents were recorded using a two- 

electrode voltage clamp. Electrodes contained 3M KC1 and 
had resistances of 0.3-1.0 MQ. Recordings were performed 
with constant perfusion at room temperature in the presence 
of either low (10 mM) or high (90 raM) potassium chloride. 

10 Two electrode voltage clamp analysis of the DmORFl gene 

product expressed in X. laevis oocytes demonstrates 
properties of a voltage- and potassium-dependent potassium 
channel. At low potassium concentrations, DmORFl exhibited 
outward current at depolarizing potentials . At high 

15 potassium concentration, DmORFl exhibits both inward and 
outward currents . The DmORFl channel displays a high 
preference for potassium and shows cation selectivity in the 
rank order K>Rb>NH 4 >Cs>Na>Li . Potassium currents were 
greatly attenuated by BaCl 2 * 



20 



Example 10 



Developmental regulation of DmORFl expression in D. 
melanogast&r determined by northern blotting analysis 

25 Isolation of pDmORFl from a D. melanogastez: expression 

library strongly suggests that the insert contained within 
originated in mRNA from that species. Detailed 
understanding of the developmental regulation of DmORFl 
expression should aid in determining strategies for use of 

30 DmORFl as a target for novel insecticides. To characterize 
DmORFl expression, northern blotting analysis of poly A RNA 
from various stages of the D. melanogaster life cycle was 
carried out. 

D. m&lanogaster poly A + RNA from embryo, larvae and 
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adult forms (Invitrogen, 5 ug) was resolved in a MOPS- 
acetate-formaldehyde agarose gel according to standard 
procedures. The gel was stained with ethidium bromide and 
photographed to mark the positions of 18 S and 28 S 

5 ribosomal RNAs used as molecular weight markers . RNA was 

transferred by capillary action to nitrocellulose with 10 x 
SSPE. The blot was air-dried, baked for one hour at 80°C, 
and prehybridized in 4x SSPE, 1% SDS, 2x Denhardt's, 0.1 % 
single stranded DNA at 68 °C for 2 hours. 

10 A 2.4 kb Xhol fragment of DmORFl was isolated from 

pDmORFl and labeled with a- 32 P dCTP using the Ready-to-Go 
kit (Pharmacia) according to manufacturers instructions. 
The probe was denatured by heating to 100°C for 5 minutes 
followed by quenching in an ice water bath. The probe was 

15 added to the prehybridization solution and hybridization 
continued for 24 hours at 68 °C. 

The blot was washed briefly with 2x SSPE, 0.1% SDS at 
room temperature followed by 0.5 x SSPE, 0 . 1 % SDS at 65 °C 
for 2 hours. The blot was air-dried and exposed to 

20 Reflection X-ray film (NEN) using an intensifying screen at 
-70 °C for 48 hours. 

Northern blotting analysis indicates that the DmORFl 
probe hybridizes to an mRNA species of approximately 2.8 kb 
isolated from D. melanogaster embryo, larvae, and adult 

25 forms. The length of the DmORFl mRNA corresponds well with 

the length of the predicted ORF. Thus, the DmORF is 
expressed at all developmental stages in the life cycle of 
D. msla.nogast.er. 

30 Example 11 

Expression of the DmORFl gene prod uct in vitro. 

DNA sequence analysis of the pDmORFl insert reveals a 
single long ORF with conserved amino acid sequence domains 
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in common with known potassium channels . The DNA sequence 
predicts an ORF sufficient to encode a protein of 618 amino 
acid in length. The DmORFl polypeptide contains four 
segments of at least 20 hydrophobic amino acids in length 
5 suggesting that the segments span the plasma membrane. In 

addition, the DmORFl protein sequence contains a putative N- 
linked glycosylation site (Asn-Thr-Thr) at amino acids 58- 
60. To confirm that a protein of the predicted size of 
DmORF is expressed from the insert in p DmORFl and to test 

10 the proposition that DmORFl is glycosylated, pDmORFl was 

used as template to drive coupled In vitro 
transcription/translation . 

Plasmid pMP147 was used as template to produce 35 S- 
labeled DmORF gene product in vitro using a TnT coupled 

15 transcription-translation kit (Pr omega) according to 

manufacturers instructions. Glycosylation of the nascent 
DmORFl polypeptide was accomplished by addition of canine 
pancreatic microsomes (Pr omega) to the transcription- 
translation reaction. Samples of glycosylated DmORF protein 

20 were treated with endoglycosidase H to remove added 

carbohydrate moieties. Aliquots were precipitated with TCA 
and collected on GF/C filters, washed with ethanol, dried 
and counted. Equivalent cpm's were resolved by SDS-PAGE. 
The gel was impregnated with soluble fluor Amplify 

25 (Amersham) and dried onto Whatman 3 MM paper. The dried gel 

was exposed to Reflection X-ray film at room temperature. 

Translation of the DmORFl gene product in vitro 
produced a polypeptide of 68 kDa, consistent with the 
predicted molecular weight of the ORF. Translation of 

30 DmORFl in the presence of canine pancreatic microsomes 

results in synthesis of a protein with reduced 
electrophoretic mobility, consistent with glycosylation of 
the nascent p lypeptide. Treatment of glycosylated DmORF 
with EndoB increased its relative mobility as expected upon 



BNSDOCID: <WO 9613520A1 I > 



SURSTTTUTE SHEET (RULE 26) 



WO 96/13520 



PCTYUS95/14364 



- 29 



removal of carbohydrate moieties. Thus, the pDmORFl insert 
is capable of directing the expression of a glycoprotein 
with the expected molecular weight. EndoB treatment removes 
carbohydrate residues consistent with the sugar added 
5 through N-linked glycosylation . 

Example 12 

High-affinity K* uptake and selectivity of DmORFl expressed 
10 in yeast . 

Expression of DmORF permits CY162 cells to grow on 
medium containing a low concentration of potassium, implying 
that DmORFl supplies high affinity potassium uptake 
15 capacity. To characterize the potassium uptake properties 
of CY162 cells containing DmORFl, 86 Rb uptake studies were 
performed. Examination of the uptake of this potassium^ 
congener revealed important aspects of potassium uptake by 
DmORFl. 

20 Yeast strains containing heterologous potassium- 

expression plasmids CYl62-DmORFl , CY162-pKAT and the control 
strain CY162-pYES2 (Clontech) were cultured overnight in SC 
Gal-ura containing 0.1 M KC1. The cells were harvested, 
washed with sterile doubled distilled water and starved for 

25 K + for 6 hours in Ca-MES buffer. Cells were washed again 

and distributed to culture tubes <10 e cells/tube) containing 
"RbCl in Ca-MES buffer. The tubes were incubated at room 
temperature, samples filtered at various time intervals and 
counted. "Rb uptake into cells was displayed. For Double 

30 Reciprocal analysis, 86 Rb was held constant and barium ions 
varied to determine Ki values. 

Th high-affinity potassium uptake capacity encoded by 
DmORFl permits high-affinity uptake f the potassium 
congener, 86 Rb, as well. Barium inhibited B «Rb uptake with a 



SUBSTITUTE SHEET (RULE 26) 



WO 96/13520 



PCT/US95/14364 



- 30 - 



Ki of pM as demonstrated in Double Reciprocal analysis. No 
high affinity 86 Rb uptake is observed in control CY162-pYES2 
cells and **Rb uptake into CYl62-pKAT cells is consistent 
with its published properties. 

5 

Example 13 

Expression of Dirosophlla melanogastBr potassium channels in 
yeast. 

10 Voltage-gated potassium channel diversity in the 

fruitfly Drosophlla melanogaster is encoded in large part by 
six genes, Shaker, Shab, Shal, Shaw, Eag, and Slo. 
Expression of these potassium channels in yeast will permit 
their introduction into screening assays for novel 

15 insecticidal compounds and facilitate characterization of 

their ion channel properties and sensitivity to compounds 
with activating and inhibitory properties. 

DNA sequences encoding Drosophlla melanogaster 
potassium channels were amplified by PCR using synthetic 

20 oligonucleotides that add 5' Hindlll or Kpn I, sites and 3* 
Xbal, SphI, or Xhol sites: 

Shaker 5 ' : AAAAAGCTTAAAATGGCACACATCACG [SEQ ID NO: 24] 
Shaker 3 ' : AAACTCGAGTCATACCTGTGGACT [SEQ ID HO: 25] 

25 

Shab 5 ' : AAAAAGCTTAAAATGGTCGGGCAATTG [SEQ ID NO: 26] 
Shab 3 ' : AAAAGCATGCTCATCTGGATGGGCA [SEQ ID NO: 27] 

Shal 5 * : AAAAAGCTTAAAATGGCCTCGGTCGCC [SEQ ID NO: 28] 
30 Shal 3 ' : TTTTCTAGACTACATCGTTGTCTT [SEQ ID NO: 29] 

Shaw 5 ' : AAAAAGCTTAAAATGAATCTGATCAAC [SEQ ID NO: 30] 
Shaw 3' : AAATCTAGATTAGTCGAAACTGAA [SEQ ID NO: 31] 
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Eag 5 1 : AAAAAGCTTAAAATGCCTGGCGGA [SEQ ID NO: 32] 
Eag 3 ' : AAATCTAGAGGCTACAGGAAGTCC [SEQ ID NO: 33] 

Slo 5 ' : GGGGGTACCAAAATGTCGGGGTGTGAT [SEQ ID NO: 34] 
5 Slo 3 ' : TTTTTCTAGATCAAGAGTTATCATC [SEQ ID NO: 35] 

Plasmids used as templates for the PCR reactions were: 
P BSc-DShakerH37, pBSc-dShabll , pBSc-dShal2+ (A) 3S , pBScMXT- 
dShaw [A. Wei, M. Covarrubias , A. Butler, K. Baker, M. Pak, 
10 L . Salkoff, Science 248, 599-603 <1990) , provided by L . 

Salkoff], pBScMXT-slo,v4 [N.S. Atkinson, G.A. Robertson, B. 
Ganetzky, Science 253,551-555, (1991), provided by L. 
Salkoff], and pBIMCH20 Eag [CH20] [J. Wannke, R. Drysdale, 
B. Ganetzky, Science 252, 1560-1564 (1991), A. Bruggemann, 
15 L.A. Pardo, W. Stuhmer, O. Pongs, Nature 365, 445-448 
(1993) , provided by B. Ganetzky] . 

Amplified fragments were digested with the appropriate 
restriction endonucleases , purified using GeneClean (Bio 
101) , and ligated into corresponding sites in pYES2 
20 (Invitrogen) . CY162 cells were transformed with assembled 

Drosophlla melanogaster potassium channel expression 
plasmids by the LiCl method and plated on SCD-ura containing 
0.1M KC1 agar medium. Selected transf ormants were tested 
for growth on arginine-phosphate-galactose (2 %) /sucrose 
25 (0.2 %)-ura agar medium containing 1-5 mM KC1 . CY162 cells 

containing pKATl or pDmORFl were cultured as positive 
controls and CY162 cells containing pYES2 were grown to 
provide a negative control. 

CY162 cells bearing Drosophlla melanogaster potassium 
30 channel expression plasmids survive under conditions in 

which growth is dependent on functional potassium channel 
expression. At potassium i n concentrati ns between 1-3 mM, 
negative control CY162 cells containing pYES2 grow poorly. 
Expression of the Drosophlla melanogaster potassium channels 



RNJSrjOCID* <W0 9613520A1 I > 



SUBSTITUTE SHEET (RULE 26) 



WO 96/13520 



PCT7US95/14364 



- 32 - 



10 



Shal, Shaw and Eag substantially improve growth of CY162. 
These results are consistent with the Drosophlla 
melanogaster potassium channels providing high-affinity 
potassium uptake capacity. This capacity is apparently 
sufficient to replace the native high-affinity potassium 
transport capacity encoded by TRK1 which is lacking in CY162 
(trJtl tx*2) cells. 

Example 14 



Cloning of a novel C. elegans secpience with homology to 
potassium channels . 

In order to expand the applicability of this technology 
to discover compounds with novel anhelntenthic activity, 

15 CY162 cells were transformed with a pYES2-based yeast 

expression library constructed using cDNA synthesized from 
C. elegans mRNA (Invitrogen) . Plasmid DNA isolated from 
yeast cells that survived the selection scheme described in 
EXAMPLE 1 were subjected to automated DNA sequence analysis 

20 performed by high temperature cycle sequencing (Applied 
Biosys terns) . Geneworks DNA sequence analysis software 
(Intelligenetics) is used to align raw DNA sequence 
information and to identify open reading frames. The DNA 
sequence of the 1.4 kb insert in pCORK is displayed in 

25 FIGURE 9A and 9B. The 5 f untranslated sequences of the cDNA 

are present in this construct. A single long open reading 
frame sufficient to encode a protein of 434 amino acids 
(predicted MW 48 kDa) is predicted in pCORK [SEQ ID NO: 38]. 
A consensus polyadenylation site, AATAAA, occurs at position 

30 1359-1364 in 3 f untranslated sequences and is followed by a 

tract of 15 consecutive A residues. The CORK ORF contains 
structural features that resemble p re forming H5 domains 
found in potassium channels. Two putative pore forming H5 
domains (residues 76-39 and 150-162) contain the G-Y/F-G 
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tripeptide motif required for potassium selectivity [L. 
Heginbotham, T. Abramson, R. MacKinnon, Science 258, 1152- 
1155, (1992)]. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: American Cyanamid Company 

(ii) TITLE OF INVENTION: Genes Encoding a Novel Family of 
Potassium Channels 

(iii) NUMBER OF SEQUENCES : 38 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: American Cyanamid Company 

(B) STREET: One Cyanamid Plaza 

(C) CITY: Wayne 

(D) STATE: New Jersey 
<E) COUNTRY: USA 

(F> ZIP: 07470-8426 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY / AGENT INFORMATION: 
(A) NAME: Matthews, Gale F. 
(B> REGISTRATION NUMBER: 32,3 69 
(C) REFERENCE /DOCKET NUMBER : 32,421-01 PCT 



<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-660-6329 

(B) TELEFAX: 201-660-7160 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 190.. 2043 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACGCGATCGC CGCGAGTGTA TATTTTTTTT TTAGCTCAGT CTTCAGTGTT TCGCGATTCT 60 

CTTTAAAAGA AAAAAAAAAT AATAAGTCAA AACTACAAAC CACACAGCGA AAGGCGAAAG 120 

CAACGGTTCC TGCGAGTGTT TATTTTTTTT TTCAACAATT TTTGATCGTA GTGCGACAAT 180 

CCGTCGAGC ATG TCG CCG AAT CGA TGG ATC CTG CTG CTC ATC TTC TAC 228 
Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr 
15 10 

ATA TCC TAC CTG ATG TTC GGG GCG GCA ATC TAT TAC CAT ATT GAG CAC 276 
He Ser Tyr Leu Met Phe Gly Ala Ala He Tyr Tyr His He Glu His 
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15 20 25 

GGC GAG GAG AAG ATA TCG CGC GCC GAA CAG CGC AAG GCG «X ATT GCA 
Gly Glu Glu Lys He Ser Arg Ala Glu Gin Arg Lye Ala Gin He Ala 
30 35 40 

ATC AAC GAA TAT CTG CTG GAG GAG CTG GGC GAC AAG AAT A« « ^ 
He Aan Glu Tyr Leu Leu Glu Glu Leu Gly Asp Lys Asn Thr Thr 
50 55 

CAG GAT GAG ATT CTT CAA CGG ATC TCG GAT TAG TGT GAC AAA CCG GTT 
Gin Asp Glu He Leu Gin Arg He Ser Asp Tyr Cys Asp Lys Pro 

70 /a 



65 



_ » r-r* rec TAC ACG TGG ACC TTC TAC 

ACA TTG CCG CCG ACA TAT GAT GAT ACG CCC TAC A£» x 

Thr Leu Pro Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr 
80 85 90 

SI £ c . 32 7S. 7£ S 3S S ^ ™ c . ^ ^ - X ^ *Sr 

95 100 105 



AAT ATA TCG CCA ACC ACC TTC GCC GGA CGG ATG ATC ATG ATC GCG IM 
Asn He Ser Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr 
110 H5 120 

TCG GTG ATT GGC ATC CCC GTC AAT GGT ATC CTC TXT GCC GGC CTC GGC 
Ser Val He Gly lie Pro Val Asn Gly He Leu Phe Ala Giy wu Y 



130 



GAA TAC TTT GGA CGT ACG TTT GAA GCG ATC TAC AGA CGC TAC AAA AAG 
Glu Tyr Phe Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys 
145 150 

TAC AAG ATG TCC ACG GAT ATG CAC TAT GTC ««™«~™ 
Tyr Lys Met Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu cay «u 
160 1«5 170 

ATC ACC ACG GTG GTG ATT GCC CTG ATT CCG GGX ATA OCT CTC TTC CTG 
He Thr Thr Val Val He Ala Leu He Pro Gly He Ala Leu Phe Leu 
175 180 185 1 

GTG CTG CCC TGC GTG GGT GTT CAC CTA CTT CGA CM «CC«tCI 

Val Leu Pro Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser 
190 195 200 

TCC ATC TCG CTG TAC TAC AGC TAT GTG ACC ACC ACA AC* ATI > GG* ^TTC 
Ser He Ser Leu Tyr Tyr Ser Tyr Val Thr Thr Thr Thr He Gly 
210 215 

GGT GAC TAT GTG CCC ACA TTT GGA GCC AAC CAG CCC AAG GAG TTC GGC 
Gly Asp Tyr Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Biy 



225 



GGC TGG TTC GTG GTC TAT CAG ATC WT GTG ATC GTG TG » TTC ATC TTC 
Gly Trp Phe Val Val Tyr Gin lie Phe Val Ho Val Trp Phe lie Fixe 
24C 245 

TCG CTG GGA TAT CTT GTG ATG ATC ATG ACA TTT ATC ACT CGG 
Ser Leu Gly Tyr Leu Val Met He Met Thr Phe lie Thr Arg Gly 
255 260 265 



CAG AGC AAG AAG CTG GGA TAC CTG GAG CAG M ™ ^^^^ 
Gin Ser Lye Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn 
270 275 280 

AAG GCC ACA CAG AAT CGC ATC TGG TCT GGC GTC ACC AAG GAT <™ «J 
Lys Ala Thr Gin Aan Arg He Trp Ser Gly Val Thr Lys Asp va^ y 



324 



372 



420 



468 



516 



564 



,612 



660 



708 



756 



804 



852 



900 



948 



996 



1044 



1092 



290 
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TAC CTC CGG CGA ATG CTC AAC GAG CTG TAC ATC CTC AAA GTG AAG CCT 1140 
Tyr Leu Arg Arg Met Leu Asn Glu Leu Tyr He Leu Lys Val Lys Pro 
305 310 315 

GTG TAC ACC GAT GTA GAT ATC GCC TAC ACA CTG CCA CGT TCC AAT TCG 1188 
Val Tyr Thr Asp Val Asp He Ala Tyr Thr Leu Pro Arg Ser Asn Ser 
320 325 330 

TGT CCG GAT CTG AGC ATG TAC CGC GTG GAG CCG GCT CCC ATT CCC AGC 1236 
Cys Pro Asp Leu Ser Met Tyr Arg Val Glu Pro Ala Pro He Pro Ser 
335 340 345 

CGG AAG AGG GCA TTC TCC GTG TGC GCC GAC ATG GTT GGC GCC CAA AGG 1284 
Arg Lys Arg Ala Phe Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg 
350 355 360 365 

GAG GCG GGC ATG GTA CAC GCC AAT TCC GAT ACG GAT CTA ACC AAA CTG 13 32 

Glu Ala Gly Met Val His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu 
370 375 380 

GAT CGC GAG AAG ACA TTC GAG ACG GCG GAG GCG TAC CAC CAG ACC ACC 13 80 

Asp Arg Glu Lys Thr Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr 
385 390 395 

GAT TTG CTG GCC AAG GTG GTC AAC GCA CTG GCC ACG GTG AAG CCA CCG 1428 
Asp Leu Leu Ala Lys Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro 
400 405 410 

CCG GCG GAA CAG GAA GAT GCG GCT CTC TAT GGT GGC TAT CAT GGC TTC 1476 
Pro Ala Glu Gin Glu Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe 
415 420 425 

TCC GAC TCC CAG ATC CTG GCC AGC GAA TGG TCG TTC TCG ACG GTC AAC 1524 
Ser Aap Ser Gin He Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn 
430 435 440 445 

GAG TTC ACA TCA CCG CGA CGT CCA AGA GCA CGT GCC TGC TCC GAT TTC 1572 
Glu Phe Thr Ser Pro Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe 
450 455 460 

AAT CTG GAG GCA CCT CGC TGG CAG AGC GAG AGG CCA CTG CGT TCG AGC 162 0 

Asn Leu Glu Ala Pro Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser 
465 470 475 

CAC AAC GAA TGG ACA TGG AGC GGC GAC AAC CAG CAG ATC CAG GAG GCA 166 8 

His Asn Glu Trp Thr Trp Ser Gly Asp Asn Gin Gin Ho Gin Glu Ala 
480 485 490 



TTC AAC CAG CGC TAC AAG GGA CAG CAG CGT GCC AAC GGA GCA GCC AAC 
Phe Asn Gin Arg Tyr Lys Gly Gin Gin Arg Ala Asn Gly Ala Ala Asn 
495 500 505 



1716 
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TCG ACC ATG GTC CAT CTG GAG CCG GAT GCT TTG GAG GAG CAG CTG AGA 17 64 

Ser Thr Met Val His Leu Glu Pro Asp Ala Leu Glu Glu Gin Leu Arg 
510 515 520 525 

AAC AAT CAC CGG GTG CCG GTC GCG TCA AGA AGT TCT CCA TGC CGG ATG 1812 
Asn Asn His Arg Val Pro Val Ala Ser Arg Ser Ser Pro Cys Arg Met 
530 535 540 

GTC TGC GAC GTC TGT TTC CCT TCC AGA AGA AGC ACC CCT CGC AGG ATC 1860 
Val Cys Asp Val Cys Phe Pro Ser Arg Arg Ser Thr Pro Arg Arg He 
545 550 555 

TGG AGC GCA AGT TGT CCG TGG TCT CGG TAC CCG AGG GTG TCA TCT CGC 1908 
Trp Ser Ala Ser Cys Pr Trp Ser Arg Tyr Pro Arg Val Ser Ser Arg 
560 565 570 

AGG AAG CCA GAT CCC CGC TGG ACT ACT ACA TCA ACA CGG TCA CGG CGG 195 6 
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Arg Lys Pro Asp Pro Arg Trp Thr Thr Thr Ser Thr Arg Ser Arg Arg 
575 580 585 

CCT CCA GTC AAT CCT ATT TGC GCA ACG GAC GCG GTC CSC CAC COC CCT 
Pro Pro val Asn Pro lie Cys Ala Thr Asp Ala Val Arg His Arg Pro 
590 S95 600 

TCG AAT CGA ATG GCA GCT TGG CCA GCG GCG GCG GCG I GGC I TAACGAACAT 
Ser Asn Arg Met Ala Ala Trp Pro Ala Ala Ala Ala Gly 

— - — 615 



610 



GGGCTTCCAG ATGGAGGATG GAGCAACCCC GCCATCGGCA TTGGGCGGTG GAGCCTATCA 
ACGCAAGGCG GCTGCTGGCA AGCGCCGACG CGAGAGCATC TACACCCAGA ATCAAGCCCC 
ATCCGCTCGC CGGGGCAGCA TGTATCCGCC GACCGCGCAC GCCTTGGCCC AGATGCAGAT 
GCGACGCGGC AGCTTGGCAA CCAGTGGCTC TGGATCGGCG GCCATGGCGG CAGTGGCCGC 
GCGTCGTGGC AGCCTCTTCC CAGCTACAGC ATCGGCATCA TCGCTGACCT CTGCTCCGCG 
CCGAAGCAGC ATATTCTCGG TTACCTCCGA AAAGGATATG AATGTGCTGG AGCAGACGAC 
CATTGCGGAT CTGATTCGTG CGCTCGAG 

(2) INFORMATION FOR SBQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Met Ser Pro Asn Arg Trp He Leu Leu Leu He Phe Tyr He Ser Tyr 

Leu Met Phe Gly Ala Ala He Tyr Tyr His He Glu His Gly Glu Glu 
20 25 

Lye He Ser Arg Ala Glu Gin Arg Lys Ala Gin He Ala He Asn Glu 

35 40 
Tyr Leu Leu Glu Glu Leu Gly A«p Lys Asn Thr Thr Thr Gin Asp Glu 

50 55 
He Leu Gin Arg He Ser Asp Tyr Cys Asp Lys Pro Val Thr Leu Pro 

65 70 75 

Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr His Ala Ph. 



85 9° 

Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn He Ser 
100 "5 liu 

Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr Ser Val He 

115 "° 

Gly He Pro Val Asn Gly He Leu Ph. Ala Gly Leu Gly Glu Tyr Phe 
130 135 140 

Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys Tyr Lys Met 
145 150 155 

Ser Thr Asp Met His Tyr Val Pr Pro Gin Leu Gly Leu He Thr Thr 
165 170 



2004 

2053 

2113 
2173 
2233 
2293 
2353 
2413 
2441 
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Val Val lie Ala Leu lie Pro Gly He Ala Leu Phe Leu Val Leu Pro 
180 185 190 

Cye Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser Ser lie Ser 
195 200 205 

Leu Tyr Tyr Ser Tyr Val Thr Thr Thr Thr He Gly Phe Gly Asp Tyr 
210 215 220 

Val Pro Thr Phe Gly Ala Ann Gin Pro Lys Glu Phe Gly Gly Trp Phe 
225 230 235 240 

Val Val Tyr Gin lie Phe Val lie Val Trp Phe lie Phe Ser Leu Gly 
245 250 255 

Tyr Leu Val Met He Met Thr Phe lie Thr Arg Gly Leu Gin Ser Lys 
260 265 270 

Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu Lys Ala Thr 
275 280 285 

Gin Asn Arg He Trp Ser Gly Val Thr Lys Asp Val Gly Tyr Leu Arg 
290 295 300 

Arg Met Leu Asn Glu Leu Tyr He Leu Lys Val Lys Pro Val Tyr Thr 
305 310 315 320 

Asp Val Asp He Ala Tyr Thr Leu Pro Arg Ser Asn Ser Cys Pro Asp 
325 330 335 

Leu Ser Met Tyr Arg Val Glu Pro Ala Pro lie Pro Ser Arg Lys Arg 
340 345 350 

Ala Phe Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg Glu Ala Gly 
355 360 365 

Mat Val His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu Asp Arg Glu 
370 375 380 

Lye Thr Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr Asp Leu Leu 
385 390 395 400 

Ala Lys Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro Pro Ala Glu 
405 410 415 

Gin Glu Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe Ser Asp Ser 
420 425 430 

Gin He Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn Glu Phe Thr 
435 440 445 

Ser Pro Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe Asn Leu Glu 
450 455 460 

Ala Pro Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser His Asn Glu 
465 470 475 480 

Trp Thr Trp Ser Gly Asp Asn Gin Gin He Gin Glu Ala Phe Asn Gin 
485 490 495 

Arg Tyr Lys Gly Gin Gin Arg Ala Asn Gly Ala Ala Asn Ser Thr Met 
500 505 510 

Val His Leu Glu Pro Asp Ala Leu Glu Glu Gin Leu Arg Asn Asn His 
515 520 525 

Arg Val Pro Val Ala Ser Arg Ser Ser Pro Cys Arg Met Val Cy» Asp 



530 



535 



540 
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Val" Cys Phe Pro Ser Arg Arg Ser Thr Pro Arg Arg He Trp Ser Ala 
545 550 555 

Ser Cya Pro Trp Ser Arg Tyr Pro Arg Val Ser Ser Arg Arg Lys Pro 

Y 565 570 575 

Asp Pro Arg Trp Thr Thr Thr Ser Thr Arg Ser Arg Arg Pro Pro Val 
580 585 590 

Asn Pro He Cya Ala Thr Asp Ala Val Arg His Arg Pro Ser Asn Arg 
595 €00 605 

Met Ala Ala Trp Pro Ala Ala Ala Ala Gly 
610 615 

(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1008 \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG TCC GAT CAG CTG TTT GTC GCA TTT GAG AAG TAT TT< = TTG ACG ACT 48 
Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 

AAC GAG GTC AAG AAG AAT GCA GCA ACG GAG ACA TGG AC* TTT TCA TCG 96 
Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 

TCC ATT TTC TTT GCC GTA ACC GTC GTC ACT ACC ATC GGA TAC OCT AAT ■ 144 

I« ^ £• Se Sa ?S Tii Val Val Thr Thr He Gly Tyr Gly Asn 
35 40 45 

CCA GTT CCA GTG ACA AAC ATT GGA COG ATA TGG TCT ATA ^ TTG TT< : TCC 192 
Pro Val Pro Val Thr Asn lie Gly Arg lie Trp Cys He Leu Phe Ser 



50 



55 60 



240 



TTG CTT GGA ATA CCT CTA ACA CTG GTT ACC ATC GCT GAC ™ ^^J* 
Leu Leu Gly He Pro Leu Thr Leu Val Thr He Ala Asp Leu Ala Gly 
65 70 75 

AAA TTC CTA TCT GAA CAT CTT GTT TGG TTG TAT GGA ^ *AT ™ *g 288 
Lys Phe Leu Ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys 

85 »0 " 



TTA AAA TAT CTC ATA TTG TCA CGA CAT CGA AAA GAA CGG AGA GAG CAC 336 
Leu Lys Tyr Leu He Leu Ser Arg His Arg Lys Glu Arg Arg Glu His 
100 105 



GTT TGT GAG CAC TGT CAC AGT CAT GGA ATG GGG CAT GA1 T ATG > AM 384 
Val Cys Glu His Cys His Ser His Gly Met Gly His Asp Met Asn He 
115 120 125 

GAG GAG AAA AGA ATT CCT GCA TTC CTG GTA TTA GCT ATT CTG AT A GTA 432 
Glu Glu Lye Arg He Pro Ala Phe Leu Val Leu Ala He Leu He Val 
130 135 "0 

TAT ACA GCG TTT GGC GGT GTC CTA ATG TCA AAA TTA GAG CCG TGG TCT 480 
Tyr Thr Ala Phe Gly Gly val Leu Met Ser Lys Leu Glu Pro Trp Ser 
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145 150 



155 160 



TTC TTC ACT TCA TTC TAC TGG TCC TTC ATT ACA ATG ACT ACT GTC GGG 52 8 

Phe Phe Thr Ser Phe Tyr Trp Ser Phe He Thr Met Thr Thr Val Gly 
165 170 175 

TTT GGC GAC TTG ATG CCC AGA AGG GAC GGA TAC ATG TAT ATC ATA TTG 576 
Phe Gly Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr lie He Leu 
180 185 19° 

CTC TAT ATC ATT TTA GGT AAA TTT TCA ATG AAA AAA AAA CAA AAA TTC 624 
Leu Tyr He He Leu Gly Lys Phe Ser Met Lye Lys Lys Gin Lys Phe 
195 200 205 

AAA ATA TTT TTA GGT CTT GCA ATA ACT ACA ATG TGC ATT GAT TTG GTA 672 
Lye He Phe Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val 
210 215 220 

GGA GTA CAG TAT ATT CGA AAG ATT CAT TAT TTC GGA AGA AAA ATT CAA 720 
Gly Val Gin Tyr He Arg Lys He His Tyr Phe Gly Arg Lys He Gin 
225 230 235 240 

GAC GCT AGA TCT GCA TTG GCG GTT GTA GGA GGA AAG GTA GTC CTT GTA 768 
Asp Ala Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val 
245 250 255 

TCA GAA CTC TAC GCA AAT TTA ATG CAA AAG CGA GCT CGT AAC ATG TCC 816 
Ser Glu Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser 
260 265 270 



CGA GAA GCT TTT ATA GTG GAG AAT CTC TAT GTT TCC AAA CAC ATC ATA 
Arg Glu Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys His He He 
275 280 285 

CCA TTC ATA CCA ACT GAT ATC CGA TGT ATT CGA TAT ATT GAT CAA ACT 
Pro Phe He Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr 
290 295 300 

GCC GAT GCT GCT ACC ATT TCC ACG TCA TCG TCT GCA ATT GAT ATG CAA 
Ala Asp Ala Ala Thr He Ser Thr Ser Ser Ser Ala He Asp Met Gin 
305 310 315 320 

AGT TGT AGA TTT TGT CAT TCA AGA TAT TCT CTC AAT CGT GCA TTC AAA 
Ser Cys Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys 
325 330 335 

TAG 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 

Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 

Ser II Phe Phe Ala Val Thr Val Val Thr Thr He Gly Tyr Gly Asn 
35 40 45 



864 



912 



960 



1008 



1011 
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Pro val Pro Val Thr Asn He Gly Arg ile Trp Cys He Leu Phe Ser 

50 55 60 

Leu Leu Gly lie Pro Leu Thr Leu Val Thr lie Ala Asp Leu Ala Gly 

65 70 '=> 

Lys Phe Leu Ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys 



85 9° 

Leu Lys Tyr Leu He Leu Ser Arg His Arg Lys Glu Arg Arg Glu His 

100 105 
Val Cys Glu His Cys His Ser His Gly Met Gly His Asp Met Asn He 

115 120 
Glu Glu Lys Arg He Pro Ala Phe Leu Val Leu Ala He Leu He Val 

130 135 140 

Tyr Thr Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser 
145 ISO "5 

Phe Phe Thr Ser Phe Tyr Trp Ser Phe He Thr Met Thr Thr Val Gly 



165 



Phe Gly Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr H- XI. Leu 



180 



Leu Tyr He He Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe 



195 



200 



Lys He Phe Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val 

210 2 " 220 

Gly Val Gin Tyr II. Arg Lys He His Tyr Ph. Gly Arg Lys He Gin 
225 230 235 

Asp Ala Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val 

245 250 
Ser Glu Lau Tyr Ala Asn Lau Met Gin Lye Arg Ala Arg Aon Met Ser 

260 265 
Arg Glu Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys Hie He He 

275 "0 285 

Pro Phe lie Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr 

290 "5 300 

Ala Asp Ala Ala Thr He Ser Thr Ser Ser Ser Ala He Asp Met Gin 
305 310 315 

Ser Cys Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys 
325 330 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TCCATTTTCT TTGCCGTAAC CGTCGTCACT ACCATCGGAT ACGGTAATCC A 



51 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

( i ) S EQUENCE CHARACTER! STICS : 

(A) LENGTH: 51 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
TCATTCTACT GGTCCTTCAT TACAATGACT ACTGTCGGGT TTGGCGACTT G 
(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Phe Leu Phe Ser lie Glu Thr Gin Thr Thr lie Gly Tyr Gly Phe 
1 5 10 15 

Arg Cys Val Thr Asp Glu Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Phe Leu Phe Ser Leu Glu Thr Gin Val Thr lie Gly Tyr Gly Phe 
15 10 15 

Arg Cys Val Thr Glu Gin Cys Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ala Phe Leu Phe Phe He Glu Thr Glu Ala Thr He Gly Tyr Gly Tyr 
15 io 15 

Arg Tyr H Thr Asp His Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn 
1 5 10 15 



lie Ser Pro Thr Thr Phe Ala Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Phe Trp Trp Ala Val Val Thr Mat Thr Thr Val Gly Tyr Gly Aap 

Met Thr Pro Val Gly Phe Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ala Phe Trp Tyr Thr He Val Thr Met Thr Thr Leu Gly Tyr Gly Asp 

Met Val Pro Glu Thr He Ala Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Ala Phe Trp Trp Ala Gly He Thr Met Thr Thr Val Gly Tyr Gly Asp 

He Cys Pro Thr Thr Ala Leu Gly 
20 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Leu Trp Trp Ala Leu Val Thr Met Thr Thr Val Gly Tyr Gly Asp 
15 10 15 

Met Ala Pro Lys Thr Tyr lie Gly 
20 

(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Leu Tyr Phe Thr Met Thr Cys Met Thr Ser Val Gly Phe Gly Asn 
15 io 15 

Val Ala Ala Glu Thr Asp Asn Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Val Tyr Phe Leu lie Val Thr Met Ser Thr Val Gly Tyr Gly Asp 
15 10 15 

Val Tyr Cys Glu Thr Val Leu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Ser Leu Tyr Thr Ser Tyr Val Thr Thr Thr Thr He Gly Phe Gly Asp 
15 10 " 
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Tyr Val Pro Thr Phe Gly Ala Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala Phe Phe Phe Ala Phe Thr Val Cya Ser Thr Val Gly Tyr Gly Aan 
1 5 10 

lie Ser Pro Thr Thr Phe Ala Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : singl e 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

S.r lie Phe Phe Ala Val Thr Val Val Thr Thr He Gly Tyr Gly A-n 
1 5 10 

Pro Val Pro Val Thr Asn Thr Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : single 

( D ) TOPOLOGY : 1 ine ar 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
S.r Leu Tyr Thr Ser Tyr Val Thr Thr Thr Thr He Gly Phe Gly Asp 
1 5 10 

Tyr Val Pro Thr Phe Gly Ala Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D ) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21i 



BNSDOCID: <WO Qfi13RPf)A1 I > 



SUBSTITUTE SHEET (RULE 26) 



WO 96/13520 PCIYUS95/14364 

46 

Ser Phe Tyr Trp Ser Phe He Th. Met Thr Thr Val Gly Ph Gly Asp 

15 10 

Leu Met Pro Arg Arg Asp Gly Tyr 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATAAAGCTTA AAAATGTCGC CGAATCGATG GAT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCTCTAGAC CTCCATCTGG AAGCCCATGT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AAAAAGCTTA AAATGGCACA CATCACG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
AAACTCGAGT CATACCTGTG GACT 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
AAAAAGCTTA AAATGGTCGG GCAATTG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AAAAGCATGC TCATCTGGAT GGGCA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
AAAAAGCTTA AAATGGCCTC GGTCGCC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

24 

TTTTCTAGAC TACATCGTTG TCTT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAAAAGCTTA AAATGAATCT GATCAAC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AAATCTAGAT TAGTCGAAAC TGAA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

AAAAAGCTTA AAATGCCTGG CGGA 

(2) INFORMATION FOR SBQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
(X) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 33: 
AAATCTAGAG GCTACAGGAA GTCC 
(2) INFORMATION FOR SBQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 34: 
GGGGGTACCA AAATGTCGGG GTGTGAT 
(2) INFORMATION FOR SBQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 35: 
TTTTTCTAGA TCAAGAGTTA TCATC 
(2) INFORMATION FOR SBQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

A.x Asp Asp Ala His Asx Asp Asp Ala Asx His Ala Asx His Ala Asp 

1 5 10 

Asx Ala Asp Asx His Asp His Ala Ala His Ala Asx Asp Ala Asx His 



20 25 
Asx Ala Asx His Ala His Ala Asp Asp Asp Ala Ala His Ala His His 

35 4° 4 

Ala Ala Asp Asx His Asp Asp His Ala Asx Asx Asp Ala Asp Asx His 

50 55 «° 

Asx Asp Asp Ala His Asx Asx Ala Asx His Ala Asp His Ala Ala Asx 
65 70 

Asp Asp Asx Asx Asp Asx Asx Ala Asp His Asp His Asp Asp His Asp 

85 *° 
Asp Ala Ala His His Asx Asp Asp Ala Ala Ala His Asp Asp His Ala 

Ala His Ala Ala His Asx Ala Ala Asx Asx Asp Ala His Asx Asp Ala 

115 «0 
Ala Asx Ala Asx Asx His Ala His Asp Asx Asx Asp His His Asp Ala 
130 «5 140 

.i „ -i _ » » » i - His His His Asx Asx Ala 
Asp Asx Ala Asp Asp Ala Ala Asp Asx AJ.a mis « 16Q 

145 150 *» 

Al. Asp Ala Asp Ala Asx Ala Ala Asp His Ala Asx Ala Ala His Ala 

165 170 
His His Asp His Asx His Ala Asx Hi- Asp Asp Asx Hi. Asx Asp Hi. 



180 



185 



A-x Ala His His ASX Hi. Hi. Asp Hi. Hi. Asp Asp Ala His His A«c 

195 200 
Asp Asp Ala Asp His Hi. Asx His His Asx His Asp Asx Asx Ala Hi. 

210 2« 220 

Asp Asp Ala Ala His Ala Hi. Asp Hi. Hi. ASX Ala Hi. Hi. Ala Ajx 
225 230 "S 

His Ala His Asx Asp Asx Asx Asx A-x His Asp Ala Asp His Ala Hi. 



245 



Hi- Ala His Hi- A. P His Asp Ala Al. Hi. Hi. Al. Hi. His Asp A.p 

260 265 
Hi. Hi. Asx Ala Ala Ala Asx Asx Asp A-p Ala His Asp Asp A-x A-p 



275 



280 



Hi. Asp A-p Ala Asp Ala Asx Ala His Asp Asp His Ala His His Asx 

290 295 
A.p Ala Ala Al. Asp Ala Ala His His Asx Ala His Asp Asp Asp Al. 
305 31° 

His A-x His Hi- Hi- Asp Al. Asx A.p Ala Hi. Hi. A-p A.p Asx His 



325 



A-x Ala A-x Ala Ala A-x Hi. Hi- Asx Asx Ala Ala Ala Asx Asp Asx 
340 345 
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Ala Ala Asp Asx Ala His His Ala Ala His Asp His His His His Asx 
355 360 365 

Ala Ala Asx Asx His Asp His His Asx Asx His Asx Ala His His Ala 
370 375 380 

His His Asp Asx His Asp Asp His Asx Asx Asx Asx His Asp Ala His 
385 390 39 s 400 

Asx His Ala Asx Asx Asp Asx Asp His Asx His His His Asp Asx His 
405 410 41S 

Asx Asx Asp Asp His His His Asp Asx His His asx Ala Ala Asx Ala 
420 425 

His Asx Asp His Asx Ala Ala Asx Asx His Asp Ala Asx Ala Ala His 
435 440 445 

Asx Ala His His Asx His Asx Ala His Asx Asx His Asx Asp His Asx 
450 455 460 

Ala His His Asp His His Asx His Asp Asp Ala Asp Asx Asx Asx Ala 
465 4 ?0 475 480 

Asx His Asp Ala Ala Asp Ala His His Asx Asx Ala His Asp His Asx 
485 49° *" 

Asx His Asp Asp His His His His His Asx His Asp Asp Asp His Ala 
500 505 510 

Ala Asx His Asx His His Asp Asp Ala Ala His Asp Asp Asx Asp Ala 
SIS 520 525 

Asx His His A*x Ala Ala His Asx Ala Ala Hi- His His His Ala Asp 
530 535 540 

Asx Ala Ala His Asp Asp Asp Asx His Ala Hi. Ala His Asp Ala Ala 
545 550 555 

Ala Ala Asx His Asx Asp Asp His His His Ala His Asp Asp Ala Asp 

555 570 3/3 

His His Asp Asp His Asp Asp Asx Asp Ala His His His His Asx Asx 

560 585 * 9V 

Asp Asx Ala Asx Ala Asx Asx His Ala Asx Ala His His Asp Asp Asx 
595 600 605 

Asp Asx His Asx His Asx His His Asp Ala Hi- His Asp Asp Ala Ala 
610 615 620 

Ala Asx Ala Ala Asx Ala His His His Asp Asx Asp Asp Ala His His 
625 «30 635 

Asp Asx His Asp Ala His Ala Ala Asx Asp Asp His His Asp His Asp 

Ala Ala Ala Ala His Asx Asp Asp Ala Asp His Asp Ala Asx Asx His 
660 665 670 

Ala His His His His Asx His Asp Ala Ala His Asp Ala His Asp Ala 
675 680 685 

Asp Asx Asx His Ala Ala Ala Asx His His Asp His His Asp Asx Ala 
690 695 7 00 

Ala His Asx Asp His Asx His Ala His His His Asx Asp Asp Asx Ala 
70S 7 « 715 '* v 
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His Ala His Asx Asp His His Asp Asp His Asp Ala His Asx Asx His 

725 730 
His Asx His Asp Asp His Asp His Asp His Asp ASX Ala Ala His His 



725 7 30 

His Asp His 
740 745 
Asp Asx Ala Asx His His His His Asx His His His Ala His Asx Ala 



755 7 «° 
Asx Ala Ala Ala Asp Asx Ala Ala Asp Ala His His His Asx His Ala 

770 775 780 

Asx Asx Ala Asx His Ala His Asx Ala Asx Asx Ala His Asx Ala Ala 
785 790 7 " 

Ala Ala Ala Asp Asp Ala Ala His Asp Asp Ala Ala Ala His His Asx 



805 



A-p asx Asp Ala Ala Ala Ala Asp Asp Asx Asp Asp Ala Ala Ala Asx 



820 



Asx Asp Ala Asx Ala A.p Ala Asx Asx Asp His Asx His Asx Asx Ala 

835 840 
His Afix Asx Ala His His Asx His His His Asp Asp Ala Asx Asx Ala 



850 855 



Asx Ala His His Asx Ala Asx Ala Ala Ala Asx His Asp His His Ala 
865 « 70 875 

His Asp Asp Asp Asx Ala Ala Asx His Asx His His Asx Ala Ala Hi. 

885 890 

Asp His His His Asp Asp His His Asx His Asp A.x Hi. His His A.p 

900 905 
Asx Asx A.p His His Ala Asx His Asx His A-x Ala Asx Ala Ala Hi. 

ASX His Hi. A.X ASX ASX HIS A.p Hi- His Ala Hi. A»p Al. His A. P 

930 93S 
Ala A-x Asx Asp His Hi. Ala Asx Asx Ala A-x Hi. A.x A.p His A.p 

945 950 " D 

Asp Ala Asp Ala His His A-x A-x Asp A.p Asx Hi. Hi. A.x Asx Hi. 



965 



Ala Ala Ala Asx Ala Ala Ala Ala His His Ala His Asp His Asx His 

980 985 
Asp Ala Ala Al. Ala A.x A-p Ala^Hi. A.p Ala Ala Al^His A.x Hi. 

Ala Asx Ala Asx Hi. Hi. Hi. A.p Asx His A-x Ala Asx Ala Ala A-p 

1010 10" 1020 

Hi- Hi- His ASX ASX His A-X Hi- His A.X His Hi. A-X Ala Ala His^ 
1025 1030 

Hi. Hi. A.p Hi. Hi. ASX Asp Asx Hi- A.p A-X A-p Ala Hi. HiS^A-p 

1045 105U 

A-p Ala Hi. Asx Asx Ala His Ala Asp Hi. Hi. Asp Asx Hi^His A.x 

1060 1° 65 
Asx Ala Ala s A-p Ala His Hi. AsxAla Asx His Asp Asp^Asx A.x A.p 
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Ala Asx Ala Asx Aax Asx Asx Aap Hia His Ala Aax Aax Hia Aax Ala 

1090 1095 1100 

Ala Ala Hia His Hia Asp Asx Aax Ala His Ala Ala Hia Aax His His 
1105 1110 1115 1120 

Aap Asx Asp His Asp Asx His Aax Hia Hia His His Asx Ala Hia His 
H25 113 0 1135 

Asx Asx Ala His His Asx Hia Hia Asx His Hia Aax His Hia Asx His 
1140 H45 II 50 

Asp Asx Ala Ala Aax His Ala His Asx Asp His Asp His Aax Asx Ala 
1155 1160 H fi5 

Aap Ala Asx Aap Asx Asp His Asp Asx His Hia Ala Hia Asx Asx His 
1170 1175 H80 

Asp His Hia His Hia Asx His His Hia Aap Ala Asp Hia Aax Hia Ala 
1185 1190 11»S "DO 

Aax Hia Aap Ala Asx Ala Hia Hia His Hia Hia Aap Hia Aap Ala His 
1205 1210 1215 

His Asp Aap His Aap Asp Ala Ala Hia Hia Asp Aax Aax Ala His Aap 

1220 1225 "SO 

Hia Aax Hia Hia Hia Hia His Asx Ala Aax Ala Hie Asp Asp Ala Hia 
1235 1240 "45 

Ala Aax Aax His Aax Ala Asp Asx Asp Aax Hia Asx His Aap Aap Aax 
1250 1255 12«0 

Ala Ala His Asp Asp Asp Ala His Ala Aax Ala Aax His Asx Aax Ala 
126S 1270 1275 1280 

Ala Ala Aax Aap Hia Aax Aap Hia Asp Aax Aax Ala Hia Aax Hia Aax 

1285 1290 1395 

Ala Aax Hia Ala Aax His Asx Ala Ala Aap Ala Hia Hia Hia Aap Aax 

1300 1305 131° 

Aax Aap Aax Hia Aax Ala Aap Aax His Hia Hia Aax Aax Aap Hia Hia 

1315 1320 "25 

His Aap Aax Ala Aax Hia Aax Hia Hia Ala Hia Aap Aap Hia Hia Aap 
1330 1335 1340 

Aap Aax Aax Hia His Asx His Aax Ala Aax Aax Aap Aap Hia Aap Aap 

1345 1350 13S5 

Aax Aax Hia Aap Hia Aap Aap Aax Aax Aax Aap Hia Hia Aap Hia Hia 

1365 1370 137S 

Ala Hia Hia Aap Ala Aap Aax Ala Aax His Hia Aax Aap Hia Aap Aap 
1380 13*5 13 »0 

Ala Aax Ala Ala Aap Aax Aax Ala Ala Aap Hia Ala Hia Aax His Hi. 
1395 1400 1405 

Ala Hia Ala Ala Ala Hia Ala Hia Hia Hia Ala Hia Ala Aap Aax Ala 
1410 1415 1420 

Hia Hia Ala Aap Ala Aap Hia Ala Hia Ala Aax Hia Hia Aap Hia Hia 
1425 1430 1435 

Ala Hia Ala Hia Aap Hia His Asp His Hia Hia His His Ala Hia Hia 
1445 1450 1433 
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Ala Ala Asp Asx His Asp His Asp Asp Ala Ala His Ala Ala Ala Ala 
1460 1465 

His Ala Ala His His Ala His His Ala Ala Ala Ala Ala Ala Ala Ala 
1475 1480 "85 

Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Asx Asp 
1490 1495 1500 

Asx Asx Asp Asx His Asx Asp Ala Asp Asx Ala His His Asx Ala His 
1505 1510 "IS 

Asp Asp Ala Asp Ala Asp Ala Ala Ala 

1525 

(2) INFORMATION FOR SEQ ID NO; 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Ser Pro Asn Arg Trp He Leu Leu Leu He Phe Tyr He Ser Tyr 
1 5 10 

L.u Mat Phe Gly Ala Ala He Tyr Tyr Hie He Glu Hie Gly Glu Glu 

Lye He ser Arg Ala Glu Gin Arg Lye Ala Gin He Ala He Aen Glu 

35 40 

Tyr Ltu Leu Glu Glu Leu Gly A.p Lye Aen Thr Thr Thr Gin Asp Glu 

50 5S 60 

He Leu Gin Arg He Ser Aep Tyr Cye Aep Lye Pro Val Thr Leu Pro 

65 70 

Pro Thr Tyr A.p Aep Thr Pro Tyr Thr Trp Thr Phe Tyr His Ala Phe 
85 90 

Ph. Phe Ala Phe Thr Val Cye Ser Thr Val Gly Tyr Gly A.n He Ser 

100 105 
Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr Ser Val He 

115 «0 125 

Gly He Pro Val Aan Gly He Leu Phe Ala Gly Leu Gly Glu Tyr Ph. 

130 "5 1«° 

Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lye Lye Tyr Lye Met 
145 150 155 

S.r Thr Aep Met Hie Tyr Val Pro Pro Gin Leu Gly Leu He Thr Thr 

165 

V.1 Val He Ala Leu He Pro Gly lie Ala Leu Phe Leu Val Leu Pro 

180 185 
Cye val Gly Val Hie Leu Leu Arg Glu Leu Gly Leu Ser S.r He Ser 

X95 200 
L u Tyr Tyr Ser Tyr Val Thr He Thr Thr He Gly Phe Gly Asp Tyr 



210 



215 
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Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly Gly Trp Phe 

225 230 235 240 

Val Val Tyr Gin He Ph Val He Val Trp Phe He Phe Ser Leu Gly 
245 250 « 5 

Tyr Leu Val Met He Met Thr Phe He Thr Arg Gly Leu Gin Ser Lys 
* 260 265 270 

Lye Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Aon Leu Lye Ala Thr 
275 280 285 

Gin Asn Arg He Trp Ser Gly Val Thr Lys Asp Val Gly Tyr Leu Arg 
290 295 300 

Arg Met Leu Asn Glu Leu Tyr He Leu Lye Val Lye Pro Val Tyr Thr 
305 310 31S "° 

Asp Val Asp He Ala Tyr Thr Leu Pro Arg Ser Asn Ser Pro Leu Ser 
325 330 

Met Tyr Arg Val Glu Pro Ala Pro He Pro Ser Arg Lye Arg Ala Phe 
340 345 

Ser Val Cya Ala Asp Met Val Gly Ala Gin Arg Glu Ala Gly Met Val 
355 360 365 

His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu Asp Arg Glu Lys Thr 
370 "5 3 «° 

Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr Asp Leu Leu Ala Lys 
385 390 3 * 5 400 

val Val Asn Ala Leu Ala Thr Val Lys Pro Pro Pro Ala Leu Gin Glu 
405 410 *" 

Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe Ser Asp Ser Gin lie 
420 425 ** M 

Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn Glu Phe Thr Ser Pro 
435 440 ** 5 

Arg Arg Pro Arg Ala Arg Ala Cya Ser Asp Phe Asn Leu Glu Ala Pro 
450 455 460 

Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser His Asn Glu Trp 
465 470 

(2) INFORMATION FOR SBQ ID NO: 38: 

( i ) SEQUENCE ' CHARACTERISTICS : 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 38: 

Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 

Asn Glu val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 " 30 

Ser lie Phe Phe Ala Val Thr Val Val Thr Thr He Gly Tyr Gly Asn 
35 40 4 5 
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Pro Val Pro Val Thr Asn He Gly Arg He Trp ue .eu ,ne «r 

50 55 60 

lie Gly He Pro Leu Thr Leu Val Thr He Ala Leu Ala Gly Lys Phe 
65 70 7 

Leu ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys Leu Lys 
85 90 

Tyr Leu He Leu Ser Arg His Arg Lys Glu Arg Arg Glu His Val Cys 

100 105 
Glu His eye His Ser His Gly Met Gly His Asp Met Asn He Glu Glu 

115 120 
Uys Arg He Pro Ala Phe Leu Val Leu Ala He Leu He Val Tyr Thr 



130 



Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser Phe Phe 
145 "0 155 

Thr Ser Phe Tyr Trp Ser Phe He Thr Met Thr Thr Val Gly Phe Gly 

165 170 
Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr He He Leu Leu Tyr 

1B0 185 

lie lie Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe Lys He 

200 " 



195 



Ph. Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val Gly Val 

210 215 

A ~ 4 _ rp_ ph 0 Glv Arg Lys He Gin Asp Ala 
Gin Tyr He Arg Lys He His Tyr Pne gxy r a4Q 

225 «° " 5 

Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val Ser Glu 

245 " u 
L.u Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser Arg Glu 

260 265 
Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys His He He Pro Ph. 

275 280 
He Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr Ala Asp 

290 295 
Ala Ala Thr He Ser Thr Ser S.r Ser Ala He Asp M.t Gin S.r Cys 
305 310 315 

Arg Ph. Cys His Ser Arg Tyr S.r Leu Asn Arg Ala Ph. Lys Xaa 
325 330 
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What is claimed is: 

1 . A potassium channel comprising four hydrophobic 
domains capable of forming transmembrane helices, 
wherein 

(x) a first pore-forming domain is interposed 

between a first and a second transmembrane 
helix; and 

(ii) a second pore-forming domain is interposed 

between a third and a fourth transmembrane 
helix. 

2 . The potassium channel of claim 1 wherein each pore- 
forming domain comprises a potassium selective peptide 
motif. 

3 . The potassium channel of claim 2 wherein the peptide 
motif is selected from the group consisting of a Y/F-G 
dipeptide motif and a G- Y/F-G tripeptide motif. 

4 . The potassium channel of claim 3 wherein at least one 
pore-forming domain is positioned proximal to an exterior 
portion of a cell membrane. 

5. The potassium channel of claim 4 further con5>rising an 
amino- terminal glycosylation site. 

6 . The potassium channel of claim 5 wherein said 
glycosylation site is asparagine-linked. 

7. The potassium channel of claim 6 characterized in that 
it belongs to a class of invertebrates. 

8. The potassium channel of claim 7 characterized in that 
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it is insect-derived. 

9. The potassium channel of claim 7 characterized in that 
it is nematode-derived. 

10. An isolated nucleotide sequence capable of encoding 
DmORF-1 . 

11. The isolated nucleotide sequence of Claim 10 comprising 
the nucleotide sequence depicted in Seq. I.D. No. 1. 

12. An isolated nucleotide sequence capable of encoding 
CORK . 

13. The isolated nucleotide sequence of Claim 12 encoding 
for the protein depicted in Sequence I.D. No. 36. 

14 . An expression vector capable of expressing a 
heterologous potassium channel in a cell membrane of a 
yeast cell comprising the nucleotide sequence of Claim 10. 

15. An expression vector capable of expressing a 
heterologous, potassium channel in a cell membrane of a 
yeast cell comprising the nucleotide sequence of Claim 11. 

16. An expression vector capable of expressing a 
heterologous potassium channel in a cell membrane of a 
yeast cell comprising the nucleotide sequence of Claim 12. 

17. An expression vector capable of expressing a 
heterologous potassium channel in a cell membrane of a 
yeast cell wherein the potassium channel comprises the 
amino acid sequence of Claim 13 . 
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18. A transformed yeast cell comprising the nucleotide 
sequences of Claims 10 , 11, 12 or 13. 

19. A transformed yeast cell comprising the expression 
vector of claims 14, 15, 16 or 17. 

20. A method of assaying substances to determine effects on 
cell growth, the method comprising the steps of: 

a. preparing cultures of yeast cells in liquid medium 
lacking uracil, the liquid medium consisting of a 
concentration of KC1 adequate to support growth of 
potassium-dependent mutant strains; 

b. plating the yeast cells in uracil-free agar 
medium, the agar medium consisting of sufficient 
KC1 to selectively support growth of potassium- 
dependent mutant strains containing a heterologous 
potassium channel of claim 1 ; 

c. applying substances to the agar plate; 

d. incubating the agar plate to permit growth; and 

e. identifying zones of growth around the substances, 
wherein the level of growth indicates whether or 
not activity of the heterologous potassium channel 
has been modulated as compared to control . 

21 . The yeast cell of Claim 20 further comprising a 
nucleotide sequence encoding RAK, or a nucleotide sequence 
of Claim 10, 11, 12 or 13. 

22. The method of claim 20, wherein said effect on cell 
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growth is modulated by activation of the potassium channel. 

23. The method of claim 20, wherein said effect on cell 
growth is modulated by inhibition of said potassium 
channel . 

24. A method of selectively inhibiting insect pests by 
applying to such insect pests a substance capable of 
inhibiting a potassium channel substantially homologous to 
that encoded by the nucleotide sequence of claim 10. 

25. A method of selectively inhibiting nematode pests by 
applying to such pests a substance capable of inhibiting a 
potassium channel substantially homologous to that encoded 
by the nucleotide sequence of claim 12 . 

26. A method of modulating the activity of a potassium 
channel positioned in a cellular membrane and comprising an 
amino acid sequence selected from the group consisting of 
SEQ ID NO: 2, SEQ ID NO: 4, and SEQ ID NO: 36, by 
contacting said cellular membrane with a substance, in an 
amount and for a period of time sufficient to inhibit the 
ability of potassium ions to pass through said channel. 
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SC galactose. 100 mM KCI 



SC galactose, 0 mM KCI 

FIG. 1 



SC glucose, OmM KCI 



SC glucose, 100 mM KCi 
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